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HUMAN HOMOLOGUE OF UNC-53 PROTEIN OF C ELEGANS 

The present invention relates to a vertebrate 
5 homologue of UNC-53 protein of C. eleaans and cDNA 
sequences coding for said homologues or functional 
equivalents thereof. The invention also relates to 
processes for identifying compounds which control cell 
behaviour, compounds identified and pharmaceutical 

10 compositions containing them in addition to processes 
and assays for identifying disease states in which 
said gene or protein is dysfunctional. 

The control of cell motility, cell shape and 
directionality of cell outgrowth of axones or other 

15 cell outgrowths is an essential feature in the 

morphogenesis and function of both unicellular and 
multicellular organisms. 

Some cell surface proteins and extra-cellular 
molecules controlling the directionality and potential 

20 of cell migration have been identified, although the 
processes involved are not generally understood. It 
is generally considered that a long-range migration of 
a cell process (also known as a growth cone extension) 
is a stepwise event, whereby prior to and after each 

25 extension there is the formation of a structure at the 
leading edge of the cell. Localised stabilisation of 
the actin cytoskeleton and association with plus end 
regions of microtubules is a general cell biological 
process underlying the choice of directional 

30 extension. 

The present inventors have surprisingly found a 
new human gene/protein belonging to the UNC-53 family 
that binds microtubules and, in particular, the plus- 
end regions of microtubules. 

35 A gene from the free-living nematode 

Caenorhabditis eleaans designated "unc-53" has been 
previously identified and cloned (Abstract, 



WO 99/63080 




PCT/EP99/03848 



- 2 - 

International C. eleaans Meeting, June 1-5 1991, 
Madison, Wisconsin, 58, Bogaert and Goh) . The present 
inventors previously identified UNC-53 protein as a 
signal transducer or signal integrator controlling the 
5 directionality of cell migration and/or cell shape in 
C. eleaans (WO 96/38555) . 

The C. eleaans UNC-53 protein (Ceunc53) and 
previously found human homologues thereof (hs-unc53/l 
and ( hs-unc53/2) were found to encode a signal 

10 transducer or a signal integrator, controlling the 
directionality of a cell migration, cell shape and 
growth extension. Evidence indicates that the 
presently found homologue designated (hs-unc53/3) 
might act as an adapter linking extracellular signals 

15 to the actin cytoskeleton . Firstly hs-unc-53/3 shows 
homology to the cortical actin binding proteins, and 
the Ce-UNC-53 protein has been shown to bind F-actin 
in vitro and leads to actin re-organization in vivo 
when expressed in mammalian cells, leading to an 

20 increased number of filopodia and lammelipodia . 

Furthermore, increased neurite extension and increased 
cell motility could be observed. Hs-UNC-53-3 may play 
an important role in the development of various 
diseases . 

25 According to a first aspect of the present 

invention there is provided a vertebrate protein 
homologue of an UNC-53 protein of C. eleaans , which 
protein comprises an amino acid sequence having one or 
more of sequence blocks A, B, C, D, E, F, G or H as 

30 illustrated in figure 4 or which differs from said 
blocks in conservative amino acid changes. 

According to a further aspect of the present 
invention, there is provided a vertebrate protein 
homologue of UNC-53 protein of C. eleaans or a 

35 functional equivalent, derivative or bioprecursor 

thereof, having an amino acid sequence encoded by the 
nucleotide sequence illustrated in figure 1 (e) . 
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For the purposes of the present invention a 
"derivative" should be taken to mean mutational 
derivatives, fusions, internal deletions, splice 
variants and muteins. 
5 Preferably, said vertebrate homologue is a human 

protein, and preferably a mammalian or a mouse 
protein. 

A further aspect of the invention comprises a 
vertebrate homologue comprising an amino acid sequence 

10 as shown in figure 1(f) or the variants thereof or an 
amino acid sequence which differs from the amino acid 
sequences shown in figure 1(f) to a significant extent 
only in one or more conservative amino acid changes. 
In a further aspect of the present invention 

15 there is also provided a nucleic acid molecule, which 
is preferably DNA, and which encodes a vertebrate 
homologue of UNC-53 protein of C. eleaans , or a 
functional equivalent derivative, fragment or 
bioprecursor of said homologue according to the 

20 invention. Preferably, the cDNA comprises a sequence 
of nucleotides encoding an amino acid sequence as 
illustrated in figure 1(f) or the variants thereof or 
an amino acid which differs from the sequences shown 
in these figures to a significant extent only in one 

25 or more conservative amino acid changes. Preferably 
the DNA is cDNA, which cDNA comprises the sequence 
shown in figure 1(e) or the variants indicated therein. 
Also provided by the present invention is a nucleic 
acid sequence capable of hybridising to the nucleic 

30 acid or DNA sequences according to the invention under 
high stringency conditions, which conditions are well 
known to those skilled in the art. 

The cDNA according to the invention may be 
included in an expression vector which may itself be 

35 used to transform or transfect a host cell, which cell 
may be bacterial or eukaryotic in origin including 
such as, for example an animal or plant cell a fungal 
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cell or an insect cell. Thus, advantageously, once 
the cDNA corresponding to the genome of the vertebrate 
homologue of UNC-53 of C. eleaans according to the 
invention is synthesised, using for example, reverse 
5 transcriptase or the like, a range of cells, tissues 
or organisms may be transfected following 
incorporation of the selected cDNA clone into an 
appropriate expression vector. The expression vector 
according to the invention may comprise a promoter of 
10 C. elegans or one of human, mouse or viral origin and 
optionally a sequence encoding a reporter molecule, 
such as, for example, green fluorescent protein. 

The present invention, therefore, also further 

r^-mir\ ~y ~\ c /rs cr n. 4— v~ n <=* <t t-> -5 /— « 1 1 i ^ v <-s-v-«^-v-*-*^tw 

15 comprising a transgene capable of expressing a 

vertebrate homologue of UNC-53 protein of C. eleaans 
according to the invention. The term "transgene 
capable of expressing a vertebrate homologue of UNC-53 
protein of C. eleaans " as used herein means a suitable 

20 nucleic acid sequence which leads to the expression of 
a vertebrate homologue of UNC-53 protein of C. eleaans 
according to the invention having the same function 
and/or activity. The transgene may include, for 
example, genomic nucleic acid isolated from the 

25 appropriate vertebrate or synthetic nucleic acid 
including cDNA. The term "transgenic organisms, 
tissues or cells, as used herein means any suitable 
organism and/or part of an organism, tissue or cell, 
that contains exogenous nucleic acid either stably 

30 integrated in the genome or in an extrachromosomal 
state . 

Preferably the transgenic cell comprises any of, 
a COS cell, HepG2 cell, MCF-7 or N4 neuroblastoma 
cell, a NIH3T3 cell, a colorectal or carcinoma cell or 
35 a human derived cell such as a fibroblast or the like. 
The transgenic organism may be an insect, a non-human 
animal or a plant and preferably C. eleaans or a 
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related nematode. Preferably, the transgene comprises 
the nucleic acid or cDNA sequence encoding the 
vertebrate homologue according to the invention as 
described above. The transgene preferably comprises an 
5 expression vector according to the invention. 

The term "functional fragment" as used herein 
should be taken to mean a fragment of the gene coding 
for the vertebrate homologue of the UNC-53 protein of 
C. eleaans according to the invention. For example, 
10 the gene may comprise deletions or mutations but may 

still encode a functional vertebrate homologue of UNC- 
53 protein. 

Further provided by the present invention is a 
method of producing a mutant vertebrate non-human 

15 organism having a mutation in the wild-type gene 

coding for the vertebrate homologue of UNC-53 protein 
according to the invention, which mutation affects 
cell behaviour or the regulation of cell motility or 
the shape or the direction of cell migration or 

20 microtubule plus end stability or function and 

localisation of protein complexes located thereon, 
which method comprises inducing a mutation in the 
vertebrate homologue of UNC-53 protein in said 
organism. These mutant organisms may be used in a 

25 screen to identify the effects of compounds on these 
cell functions. 

The vertebrate homologue of UNC-53 protein of 
C. eleaans or the cDNA or genomic DNA encoding it or a 
functional equivalent, derivative, fragment or 

30 bioprecursor of said homologue, may advantageously be 
used as a medicament, or in the preparation of a 
medicament to treat or prevent disorders associated 
with inhibition of overexpression of the vertebrate 
homologue of UNC -53 according to the invention. Such 

35 disorders may be alleviated by promoting neuronal 

regeneration, revascularisation or wound healing or 
the treatment of chronic neurodegenerative disorders, 
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psychiatric disorders or acute traumatic injuries or 
fibrotic disease or disease in which physiological 
events requiring the polarity of cells or epithelia 
are abnormally functioning. Accordingly, the 
5 vertebrate homologue according to the invention, 
dominant positive or negative mutants thereof, or 
inhibitors thereof may advantageously be used to 
induce or alleviate contact inhibition in a cell or in 
preventing carcinoma development. Typically, the 
10 above medical conditions may be treated in mammals and 
more preferably humans by either the homologue of UNC- 
53 protein or alternatively by a nucleic acid coding 
for the protein or the protein itself according to the 

_ — ,. - -—~ -—>——-->—'—-»- * s^-i-jr <— >>_/\_.i..t.4_j s_ vj.j.^wiiu^o.^u u± vac 

15 to said UNC-53 vertebrate homologue may be used to 
prevent it's expression. Examples of other nucleic 
acid sequences which may be used include 3' 
untranslated regions of mRNA which could be used to 
prevent transcription of the genomic sequence encoding 

20 for the vertebrate homologue of UNC-53 protein 
according to the invention. 

The vertebrate homologue of UNC-53 protein 
according to the invention may be incorporated into a 
pharmaceutical^ acceptable composition together with 

25 a suitable carrier, diluent or excipient therefor. 
The pharmaceutical composition may advantageously 
comprise, additionally or alternatively, the nucleic 
acid sequence according to the invention as defined 
above . 

30 The induction or inhibition of the expression of 

hu-UNC-53/3 by pharmacological means may 
advantageously be used to induce neuronal 
regeneration, revascularisation or wound healing or be 
involved in the treatment of chronical 

35 neurodegenerative disorders, or acute traumatic 

injuries or fibrotic diseases, or physiological events 
requiring the polarity of cells, or oncology and 
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metastasis of cells, or apoptotic pathways. 

The present invention therefore also provides- for 
a method of determining whether a compound is an 
inhibitor or enhancer of the regulation of cell 
5 behaviour, growth, transformation, cell shape or 
motility or the direction of cell migration, 
microtubule plus end stability or function and 
localisation of protein complexes thereon, which 
method comprises contacting said compound with a 

10 transgenic cell according to the invention and 

screening for a phenotypic change in said cell. The 
method can therefore be used to determine whether the 
compound comprises an inhibitor or an enhancer of the 
signal transduction pathway of said transgenic cell of 

15 which pathway said vertebrate homologue of UNC-53 

protein according to the invention is a component, or 
whether said compound is an inhibitor or an enhancer 
of a parallel or redundant signal transduction pathway 
in said cell. The present invention also provides a 

20 method to determine that the protein in said signal 

transduction pathway is a vertebrate homologue of UNC- 
53 protein of C. eleaans according to the invention. 

Preferably, the phenotypic change to be screened 
comprises a change in cell shape or a change in cell 

25 motility. Where a transgenic cell is used in 

accordance with one embodiment of the method of the 
invention, an N4 neuroblastoma cell may be used and in 
such an embodiment the phenotypic change to be 
screened may be the length of neurite growth, changes 

30 in filopodia outgrowth, changes in ruffling behaviour 
or cell adhesion, any change in microtubule 
cytoskeleton, any change in localisation of proteins 
on plus end regions of microtubules or any change in a 
cell such as apoptosis. In an alternative embodiment 

35 of the method of the invention, the transgenic cell 
may comprise an MCF-7 breast carcinoma cell. 
Typically in such an embodiment the phenotypic change 
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to be screened comprises the extent of phagokinesis or 
filopodia formation. In an alternative embodiment- of 
this aspect of the invention, the transgenic cell may 
comprise an NIH3T3 cell. Typically in such an 
5 embodiment the phenotypic change to be screened 
comprises loss of contact inhibition of foci 
formation. The method according to the invention, may 
also utilise a mutant cell or mutant organism 
according to the invention as described above, where 
10 the mutant cell is capable of growing in tissue 
culture or in vivo and either of which cell or 
organism has a mutation in the wild-type unc-53 gene. 
In accordance with the present invention, a 

M- r>Vi Print" wn i r* rhanno'' maw /"•/-vmr^v-i anu r^Ko^Af ^ r^-x/-s 

X *~ ~" J L "w*-— -*5 f w^in^i. j ^*iCnw l-^f^ 

15 resulting from changes at any suitable point in the 
life cycle of the cell, tissue or organism defined 
above, which change can be attributed to the 
expression of the transgene of the invention such as 
for example, growth, viability, morphology, behaviour, 

20 movement, cell migration or cell process or growth 

cone extension of cells and includes changes in body 
shape, locomotion, chemotaxis, contact inhibition, 
mating behaviour or the like. The phenotypic change 
may preferably be monitored directly by visual 

25 inspection of the cell as a whole or by monitoring the 
F-actin cytoskeleton microtubule network and plus end 
stability of microtubules or proteins thereon or 
alternatively by for example measuring indicators of 
viability including endogenous or transgenically 

30 introduced histochemical markers or other reporter 
genes, such as for example p-galactosidase or green 
fluorescent protein. 

A compound which is identifiable by the method 
according to the invention as described above, as an 

35 enhancer of the processes identified above such as the 
regulation of cell shape or motility or the direction 
of cell migration may be used as a medicament, or 
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alternatively in the preparation of a medicament, for 
promoting neuronal regeneration, revascularisation- or 
wound healing, or for treatment of chronic neuro- 
degenerative diseases or acute traumatic injuries or 
5 fibrotic disease. Examples of promoting neuronal 

regeneration include, for example, peripheral nerve 
regeneration after trauma and spinal cord trauma. 

Where a compound is identified in accordance with 
the method described above as being an inhibitor of 

10 the regulation of cell shape or mobility or the 

direction of cell migration, the compound may be used 
as a medicament, or in the preparation of a 
medicament, for substantially alleviating spread of 
disease inducing cells, such as in spread of 

15 carcinoma, or the like in metastasis or in alleviating 
loss of contact inhibition. Advantageously, any of 
the compounds which may have been identified as an 
inhibitor or an enhancer in accordance with the method 
as described above, may also be included in a 

20 pharmaceutical composition comprising the respective 
compound and a pharmaceutically acceptable carrier, 
diluent or excipient therefor. 

The particular mechanism of action of a compound 
identified as either an inhibitor or an enhancer of 

25 the cell motility shape, growth or direction of cell 
migration or microtubule association or to the plus 
end region thereof is not limiting. Preferably the 
compound acts as an inhibitor or enhancer of a signal 
transduction pathway. The compound may also act on a 

30 parallel pathway or directly on the vertebrate 
homologue of UNC-53 protein of C. eleaans . For 
example, the method of action of the compound may 
include direct interaction with the vertebrate 
homologue of UNC-53 protein, interaction with 

35 processes for regulating phosphorylation or 

dephosphorylation of the vertebrate homologue of UNC- 
53 or with processes regulating activity of an unc-53 
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gene or with processes for post-transcriptional or 
post-translational modification or the like. 

Preferably the compound is identified by the 
method according to the invention as an inhibitor or 
an enhancer, by utilising differences of phenotype of 
the cell, tissue or organism, which are visible to the 
eye. Alternatively indicators of viability including 
endogenous or transgenically introduced histochemical 
markers or a reporter gene may be used. 

According to a further aspect of the invention 
there is also provided a transgenic cell or tissue 
culture which has been constructed to comprise a 
promoter sequence of a gene coding for a vertebrate 
homoloaue of UNC-53 of r. . ^1 

invention operably linked to a nucleic acid sequence 
encoding a reporter molecule. Preferably, the 
reporter sequence encodes for a detectable protein, 
for example one which may be monitored by eye 
inspection such as antibiotic resistance, p- 
galactosidase or a molecule detectable by 
spectropho tome trie, spectrof luorometric, luminescent 
or radioactive assays. 

The present invention also provides a method of 
determining whether a compound is an inhibitor or an 
enhancer of transcription of a gene coding for a 
vertebrate homologue of UNC-53 protein in C. elecrans , 
according to the invention which method comprises the 
steps of: 

(a) contacting said compound with a transgenic 
cell according to the invention as described 
above, 

(b) monitoring the level of said reporter 
molecule and comparing results obtained from this 
monitoring step with a control comprising a 
transgenic cell having the promoter sequence of a 
gene coding for a vertebrate homologue of UNC-53 
protein, or a functional fragment of said 
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homologue and the reporter molecule/ in the 
absence of the compound* 

In one embodiment of the method according to this 
aspect of the invention the reporter molecule may 
5 comprise messenger RNA. 

A compound identified as an enhancer of 
transcription of the gene coding for the vertebrate 
homologue of UNC-53 protein of C. eleaans or a 
functional equivalent, derivative or bioprecursor of 

10 said homologue may also be used as a medicament, or in 
the preparation of a medicament, for promoting 
neuronal regeneration, revascularisation or wound 
healing, or for treatment of chronic neuro- 
degenerative diseases or acute traumatic injuries or 

15 fibrotic disease- Furthermore, such compounds may be 
included in a pharmaceutical composition including a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. Any compounds identified as 
inhibitors of transcription may, advantageously, be 

20 used in alleviating the spread of disease inducing 
cells such as carcinomas or metastasis or loss of 
contact inhibition . 

The present invention also provides a kit for 
determining whether a compound is an enhancer or an 

25 inhibitor of the regulation of cell growth, 

transformation, cell motility or shape or the 
direction of cell migration which .kit comprises at 
least one transgenic or mutant cell or transgenic or 
mutant non-human organism according to the invention 

30 as described above and a plurality of wild-type cells 
or a wild-type organism of the same type, or a cell 
line or tissue culture and means for contacting said 
compound with said cell or organism. 

Also provided by the present invention is a kit 

35 for determining, whether a compound is an inhibitor or 
an enhancer of transcription of a gene coding for a 
vertebrate homologue of UNC-53 protein of C. eleaans 
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according to the invention which kit comprises at 
least one transgenic cell or cells according to the 
invention, means for contacting said compounds with 
said cells and means for monitoring the level of 
5 transcription of said transgenic cell or cells 
according to the invention. 

For the purposes of the present invention, the 
term "gene coding for a vertebrate homologue of UNC-53 
or a functional fragment of said homologue" includes 
.0 the nucleic acid sequence shown in figure 1 or a 

fragment thereof, including the differentially spliced 
isoforms and transcriptional starts of the nucleic 
acid sequence and which sequence encodes a vertebrate 
homologue of UNC-53 protein or a functional 
5 equivalent, derivative, fragment or bioprecursor of 
the protein. 

The present invention also provides methods of 
identifying genes of vertebrates or fragments of said 
genes, which encode proteins which are active in the 
0 signal transduction pathway of which the vertebrate 

homologue of UNC-53 according to the present invention 
is a component. A preferred method comprises 
hybridizing to an appropriate cDNA library a 
nucleotide sequence, as defined herein, or a fragment 
5 thereof under appropriate conditions of stringency in 
order to identify genes having statistically 
significant homology with the cDNA clones of any one 
of the cDNA sequences according to the invention 
described above. 
0 Furthermore, there is also provided by the 

present invention a method of identifying a protein 
.which is active in the signal transduction pathway of 
a cell of which a vertebrate homologue of UNC-53 
protein of C. eleaans according to the invention is a 
5 component. According to this aspect of the invention, 
the method comprises; 

(a) contacting an extract of said cell with an 
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antibody to the vertebrate homologue of UNC-53 
protein or a functional equivalent, fragment or 
bioprecursor of said protein, 

(b) identifying the antibody/vertebrate 
5 homologue of UNC-53 complex, and 

(c) analysing the complex to identify any 
protein bound to the vertebrate homologue of 
UNC-53 protein other than the antibody. 

The vertebrate homologue of UNC-53 protein, 

10 therefore may bind regions of other proteins involved 
in the signal transduction pathway. It is also 
possible to sequentially identify a whole range of 
proteins involved in the signal transduction pathway. 
Antibodies to the vertebrate homologue of UNC-53 

15 protein may be produced according to known techniques 
as would be known to those skilled in the art. For 
example, polyclonal antibodies may be prepared by 
inoculating a host animal, such as a mouse, with a 
protein or epitope of a protein according to the 

2 0 invention and recovering immune serum. 

This aspect of the invention, further comprises a 
method of identifying a further protein or proteins 
which are active in the signal transduction pathway of 
a cell of which the vertebrate homologue of UNC-53 is 

25 a component which method comprises: 

(a) forming an antibody to the first identified 
protein bound to the vertebrate homologue of 
UNC-53 protein in the method as described above, 

(b) contacting a cell extract with the antibody, 
30 (c) identifying any antibody/protein complex, 

(d) analysing the complex to identify any 
further protein bound to the first protein other 
than the antibody, and 

(e) optionally repeating steps (a) to (d) to 
35 identify further proteins in the pathway. 

According to this aspect of the present 
invention, the antibody starts the process by binding 
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to the vertebrate homologue of UNC-53 protein 
according to the invention in the signal transduction 
or oncogenic pathways. Any other proteins found 
complexed to the bound antibody or UNC-53 protein can 
then be used to identify further interacting proteins 
involved in the pathway. 

It may also be possible to identify proteins 
involved in the signal transduction pathway of a cell 
of which the vertebrate homologue of UNC-53 is a 
component by using a vertebrate homologue of UNC-53 < 
protein of C. eleaans . According to this aspect of 
the invention the method comprises: 

(a) contacting an extract of the cell with the 

C. eleaans or a functional equivalent, 
fragment or bioprecursor of said homologue, 

(b) identifying the vertebrate homologue of UNC- 
53 protein/protein complex formed and 

(c) analysing the complex to identify any 
protein bound to the vertebrate homologue of 
UNC-53 protein other than the same 
vertebrate homologue of UNC-53 protein. 

This method can also advantageously be used to 
identify further proteins in a signal transduction 
pathway of a cell by contacting an extract of the cell 
used as described above, with any protein identified 
from step (c) above not being a vertebrate homologue 
of UNC-53 protein and repeating steps (b) and (c) . 

Other methods which may be used for identifying 
proteins in a signal transduction pathway of a cell 
may comprise for example a western blot overlay method 
which method is well known to those skilled in the 
art. Cell extracts are run on gels to separate out 
protein and subsequently blotted onto a nylon 
membrane. These membranes may then be incubated, for 
example in a medium containing vertebrate homologue of 
UNC-53 having a label attached thereto such as a 
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biotin or radiolabel and any protein conjugates 
visualised with for example a streptavidin or alkaline 
phosphatase conjugated antibody. 

The present invention also advantageously 
5 provides a process for the preparation of binding 
antibodies which recognise proteins or fragments 
thereof involved in the rate and direction of cell 
migration or the control of cell growth or shape, for 
the above methods . 

10 The monoclonal antibody for binding to the 

appropriate vertebrate homologue of UNC-53 (or its 
functional equivalent) may be prepared by known 
techniques as described by Kohler R. and Milstein C, 
(1975) Nature 256, 495 to 497. 

15 Another method which may be used to identify 

proteins involved in the signal transduction pathway 
of a cell of which a vertebrate homologue of an UNC-53 
protein of C. eleaans according to the invention or is 
a component, involves investigating protein-protein 

20 interactions using the two-hybrid vector method. This 
method, which is well known to those skilled in the 
art was first developed in yeast by Chien et al 
(1991) . This technique is based on functional 
reconstruction in vivo of a transcription factor which 

25 activates a reporter gene. More particularly the 

technique comprises providing an appropriate host cell 
with a DNA construct comprising a reporter gene under 
the control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 

30 domain, expressing in the host cell a first hybrid DNA 
sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 

35 host at least one second hybrid DNA sequence, such as 
a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
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binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 
investigated with a protein according to the invention 
5 by detecting for the presence of any reporter gene 

product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

An example of such a technique utilises the GAL4 
protein in yeast. GAL4 is a transcriptional activator 

10 of galactose metabolism in yeast and has a separate 
domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 

15 residues encoding the DNA binding domain of GAL4 . 

These binding domain residues may be fused to a known 
protein encoding sequence, such as for example a 
sequence coding for the vertebrate homologue of 
UNC-53. The other vector comprises the residues 

20 encoding the protein binding domain of GAL 4 . These 
residues are fused to residues encoding a test 
protein, preferably from the signal transduction 
pathway of the vertebrate in question. Any interaction 
between the vertebrate homologue of UNC-53 protein and 

25 the protein to be tested leads to transcriptional 
activation of a reporter molecule in a GAL -4 
transcription deficient yeast cell into which the 
vectors have been transformed. Preferably, a reporter 
molecule such as 6-galactosidase is activated upon 

30 restoration of transcription of the yeast galactose 
metabolism genes. This method enables any 
interactions between proteins involved in the signal 
transduction pathway or a parallel or redundant 
pathway to be investigated. 

35 Any proteins identified in the signal 

transduction pathway of the cell, which may be for 
example a mammalian cell, may also be included in a 
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pharmaceutical composition together with a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor . 

The present invention also provides a process for 
5 producing a vertebrate homologue of an UNC-53 protein 
of C. eleaans according to the invention which process 
comprises culturing the cells transformed or 
transfected with a cDNA expression vector having any 
of the cDNA sequences according to the invention as 

10 described above, and recovering the expressed protein 
homologue. The cell may advantageously be a 
bacterial, animal, insect or plant cell. 

A particularly preferred process for producing 
said vertebrate homologue of UNC-53 protein uses 

15 insect cells. Accordingly, the invention provides a 

process for producing a vertebrate homologue of UNC-53 
protein of C. eleaans according to the invention which 
process comprises culturing an insect cell transformed 
or transfected with a recombinant Baculovirus vector, 

20 said vector comprising a nucleotide sequence encoding 
said vertebrate homologue of UNC-53 protein according 
to the invention downstream of the Baculovirus 
polyhedrin promoter and recovering the expressed 
protein. Advantageously, this method produces large 

25 amounts of protein for recovery. The insect cell may 
be from for example Soodootera f ruaioerda or 
Drosoohila Melanoaester . 

In accordance with the present invention, a 
defined nucleic acid sequence includes not only the 

30 identical nucleic acid but also any minor base 

variations from the natural nucleic acid sequence 
including in particular, substitutions in bases which 
result in a synonymous codon (a different codon 
specifying the same amino acid) , due to the degenerate 

35 code in conservative amino acid substitution. The 
term "nucleic acid sequence" also includes the 
complimentary sequence to any single stranded sequence 
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given which includes the definition above regarding 
base variations. 

Furthermore, a defined protein, polypeptide or 
amino acid sequence according to the invention, 
5 includes not only the identical amino acid sequence 

but also minor amino acid variations from the natural 
amino acid sequence including conservative amino acid 
replacements (a replacement by an amino acid that is 
related in its side chains) . Also included are amino 

10 acid sequences which vary from the natural amino acid 
but result in a polypeptide which is immunologically 
identical or similar to the polypeptide encoded by the 
naturally occurring sequence. Such polypeptides may 
be encoded by a corresponding nucleic acid sequence. 

15 A further aspect of the invention provides a 

nucleic acid sequence of at least 15 nucleotides of a 
nucleic acid according to the invention and preferably 
from 15 to 50 nucleotides. 

These sequences may, advantageously be used as 

20 probes or primers to initiate replication or the like. 
Such nucleic acid sequences may be produced according 
to techniques well known in the art, such as by 
recombinant or synthetic means. They may also be used 
in diagnostic kits or the like for detecting for the 

25 presence of a nucleic acid according to the invention. 
These test generally comprise contacting the probe 
with a sample under hybridising conditions and 
detecting for the presence of any duplex formation 
between the probe and any nucleic acid in the sample. 

30 Nucleic acid sequences according to the invention may 
also be produced using recombinant or synthetic means 
such as described in Sambrook et al (Molecular 
Cloning: A Laboratory Manual, 1989) . Advantageously, 
human allelic variants or polymorphisms of the DNA 

35 according to the invention may be identified by, for 
example, probing DNA from a range of individuals for 
example from different populations. Furthermore, 
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nucleic acids and probes according to the invention 
may be used to sequence genomic DNA from patients 
using techniques well known in the art, such as the 
Sanger Dideoxy chain termination method, which may 
5 advantageously ascertain any predisposition of a 
patient to certain disorders. 

A method of detecting whether a compound is an 
inhibitor or an enhancer or expression of a vertebrate 
homologue of UNC-53 of C. eleaans , according to the 

10 invention is also provided which method comprises 

contacting a cell expressing said homologue with said 
compound and monitoring for a phenotypic change 
compared to a control cell which has not been 
contacted with said compound. 

15 Preferably the cell is a transgenic cell as 

described above- Alternatively the cell may have 
undergone loss of contact inhibition. 

The present method also provides for determining 
whether said compound is an inhibitor or expression of 

20 said vertebrate homologue. In one embodiment the 
compound to be tested comprises a nucleic acid. 

Preferably said nucleic acid sequence comprises 
an antisense DNA sequence or a mRNA sequence. 

Preferably said mRNA sequence comprises 3' 

25 untranslated regions of mRNA encoding for said 
vertebrate homologue. 

Alternatively, the compound to be tested may be a 
protein. Preferably, said protein comprises a protein 
having an amino acid sequence potentially suitable for 

30 inhibiting function of said vertebrate homologue and 
preferably comprises a protein identified by the 
methods as described herein. 

The present invention also provides a 
pharmaceutical composition comprising a compound, for 

35 example an antisense nucleic acid identified according 
to the above described method together with a 
pharmaceutical^ acceptable carrier, diluent or 
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excipient therefor. 

A nucleic acid sequence or protein identified 
according to this aspect of the invention may be used 
as a medicament, or in the preparation of a 
5 medicament, for treating loss of contact inhibition of 
cancer which is mediated by vertebrate homologue of 
UNC-53 protein or a functional equivalent, fragment, 
derivative or bioprecursor of said homologue. 

Further provided by the invention is a nucleic 
10 acid as defined above for use in preparation of a 

medicament for inhibiting expression of a gene coding 
for a vertebrate homologue of UNC-53 protein of 
C. eleaans . 

Further provided by the invention is an assay for 

15 detecting expression of the vertebrate homologue of 
UNC-53 protein of C. eleaans in a vertebrate cell 
which assay comprises contacting a cell or an extract 
thereof with an antibody to said vertebrate homologue, 
which antibody is fused to a reporter molecule, 

20 removing any unbound antibody and monitoring for the 
presence of said reporter molecule. 

Preferably the reporter molecule is an antibody 
conjugated to for example a fluorophore such as 
fluorescein or alternatively to an enzyme such as 

25 strepavidin. 

There is also provided a method for detecting for 
expression of a gene coding for the vertebrate 
homologue of UNC-53 protein of the invention which 
method comprises contacting a probe specific for a 

30 nucleic acid of protein sequence coding for or 

corresponding to said vertebrate homologue according 
to the invention with a cell extract, which probe is 
linked to a reporter and analysing for the presence of 
said reporter. 

35 Preferably the probe is a complementary sequence 

to a region of mRNA transcribed from said gene 
encoding said vertebrate homologue of UNC-53 protein 



BNSDOCID: <WO 9963080A1_I_> 



WO 99/63080 




PCT/EP99/03848 



- 21 - 

according to the invention. 

Preferably the complimentary sequence is a 3' or 
5' untranslated region of said mRNA. Preferably said 
reporter may be a dig label, a fluorophore, a hapten 
5 or a radiolabel. 

Alternatively said probe may comprise an antibody 
specific for said vertebrate homologue of said UNC-53 
protein. 

Preferably the reporter is an antibody conjugated 

10 to for example a fluorophore such as fluorescein or 
alternatively an enzyme such as streptavidin. 

As described above, UNC-53 protein of C.elegans 
has been found to localise to microtubule and 
particularly to microtubule ( + ) ends. Therefore, 

15 there is provided by a further aspect of the present 
invention a method of determining whether a compound 
is an inhibitor or an enhancer of association of the 
UNC-53 homologue of the invention to microtubules or 
plus end regions thereof, which method comprises (a) 

20 contacting said compound with a transgenic cell, 
tissue or organism expressing said vertebrate 
homologue and which protein is operably linked to a 
reporter molecule (b) screening for the localisation 
of said reporter molecule as compared to a cell 

25 according to step (a) which has not been contacted 
with said compound. 

A compound identifiable by the above method also 
forms part of the present invention. Such a compound 
identified as an inhibitor of localisation or 

30 association of said vertebrate homologue with 

microtubules or the plus end region thereof may be 
used in alleviating the spread of disease inducing 
cells or metastasis or loss of contact inhibition. 
Further a compound identified as an enhancer of 

35 association of said vertebrate homologue with 

microtubules or the plus end region thereof may be 
used in for example promoting neuronal regeneration, 
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revascularisation or wound healing, or for treating 
chronic neurodegenerative diseases or acute traumatic 
injuries or fibrotic disease. These compounds may 
then be included in a pharmaceutical composition, 
5 together with a pharmaceutical^ acceptable carrier, 
diluent or excipient therefor . 

Also provided by the present invention is a kit 
for determining whether a compound is an inhibitor or 
an enhancer of association of the vertebrate homologue 

10 thereof according to the invention with microtubules 
or the plus end regions thereof, which kit comprises 
at least one transgenic cell expressing said UNC-53 
vertebrate protein homologue and a reporter molecule 
or a host or transgenic ceil according to the 

15 invention and at least one cell of the same cell type 
for use as a control and means for contacting said 
compound with one of said at least one transgenic 
cells. Compounds identified as inhibitors or 
enhancers or microtubule association described above 

20 may advantageously be included in a composition and 
linked to said vertebrate homologue according to the 
invention to target the compounds to the microtubules 
or the plus end regions thereof. Such a composition 
may also comprise, for example, a suitable 

25 transfecting or transformation agent. 

According to a further aspect of the invention 
there is provided a method of targeting a protein to a 
cell microtubule or the plus end region thereof, which 
method comprises introducing into a host cell, tissue 

30 or organism a transgene comprising a sequence capable 
of expressing said UNC-53 vertebrate homologue 
according to the invention, which sequence is operably 
linked to a sequence encoding said protein to be 
targeted such that a chimeric protein is expressed and 

35 which results in targeting of said protein to said 
microtubule or a plus end region thereof. An even 
further aspect of the invention comprises a method of 
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identifying a molecule which covalently modifies UNC- 
said vertebrate homologue according to the invention, 
which method comprises a) contacting either an extract 
from a cell or cells expressing said vertebrate 
5 homologue or a mixture of enzymes comprising candidate 
UNC-53 modifying enzymes in the presence of an 
indicator of covalent modification of a protein, b) 
identifying any covalently modified UNC-53 protein 
from step a) and c) identifying said molecule involved 
10 in said modification step. Such an indicator may be 
32 P. 

Further provided by the invention is a method of 
identifying a compound which alleviates or enhances 
the toxicity of said UNC-53 vertebrate homologue 

15 thereof according to the invention, or which 

alleviates or enhances apoptosis. The method of the 
former comprises contacting said compound with a 
transgenic cell, tissue or organism according to the 
invention and monitoring for the presence of said 

20 reporter molecule adjacent said microtubules or the 

plus end region thereof* In the case of apoptosis the 
method comprises monitoring the effect of the compound 
on cell death. 

The invention may be more clearly understood from 

25 the following examples which are purely exemplary, 

with reference to the accompanying drawings wherein, 

Figure 1(a) is an illustration of the nucleotide 
sequence encoding the first human homologue of UNC-53 
designated Hs-UNC-53/1 and further variants thereof. 

30 Figure 1(b) is an illustration of the amino acid 

sequence of hs-UNC-53/1 encoded by the sequences in 
Figure 1 (a) . 

Figure 1(c) is an illustration of the nucleotide 
sequence encoding the second human homologue of UNC-53 
35 protein of C. eleaans designated Hs-UNC-53/2 and 
further variants thereof. 

Figure 1(d) is an illustration of the amino acid 
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sequences of Hs-UNC-53/2 encoded by the sequences in 
Figure 1 (c) . 

Figure 1(e) is an illustration of a nucleotide 
sequence encoding the third human homologue of UNC-53 
5 protein according to the invention designated Hs-UNC- 
53/3, and variants thereof. 

Figure 1(f) is an illustration of the amino acid 
sequences of the Hs-UNC-53/3 encoded by the sequences 
of Figure 1(e). 
10 Figure 1(g) is an illustration of the nucleotide 

sequence of a genomic DNA fragment that contains a 
putative 5' exon of Hs-unc-53/1. 

Figure 1 (h) is an illustration of the nucleotide 
sequence AB023 155 encoding the protein KIAA0938, a 
15 transcript comprising the 3' half of Hs-unc-53/3. 

Figure l(i) is an overview of the C. elegans and 
human UNC-53 proteins as cloned. The 5' truncated 
variants and a number of the known splice variants 
have been indicated. 
20 Figure 2 is an alignment of the amino acid 

sequences of Ce-UNC-53, Hs-UNC-53/1, Hs-UNC-53/2 and 
Hs-UNC-53/3. 

Figure 3 is an alignment of the C. elegans unc-53 
and the predicted amino acid sequence of C. briggsiae 
25 unc-53. 

Figure 4 is a list of ProSite signatures for 
vertebrate UNC-53s based on the sequence alignment. 

Figure 5a is an illustration of expression of the 
three human UNC-53s as studied by Northern blotting. 
30 Figure 5(b) is an illustration of differential 

expression of Hs-unc-53/3 in different brain parts. 

Figure 6(a) is an illustration of differential 
splice variant expression of Hs-unc-53/1 using RT-PCR. 

Figure 6(b) is an illustration of differential 
35 splice expression of Hs-unc-53/2 using RT-PCR. 

Figure 6(c) is an illustration of differential 
expression of Hs-unc-53/3 using RT-PCR. 
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Figure 6(d) is a sequence confirmation of 
AB023155 expression in cells other than brain using 
RT-PCR. 

Figure 7(a) is an illustration of the cloning of 
5 Hs-unc-53/3. 

Figure 7(b) is a plasmid map and the nucleotide 
sequence of the pGI3303 expression vector ( C-terminal 
Hs-unc-53/3 fragment in fusion with GFP) . 

Figure 7(c) is an illustration of the amino acid 
10 sequence of GFP: C-terminal Hs-unc-53/3 fragment 
(insert of pGI3303) . 

Figure 7(d) is a plasmid map and the nucleotide 
sequence of the pGI3305 expression vector (full length 
Hs-unc-53/3 in fusion with GFP) . 
15 Figure 7(e) is an illustration of the amino acid 

sequence of GFP : Hs-unc-53/3 (insert of pGI3305) . 

Figue 8 is an illustration of the filipodia and 
lamellipodia outgrowth of N4 mouse neuroblastoma cells 
transfected with pGI3303 (F-actin cytoskeleton 
20 reorganisation) 

Figure 9 is an illustration of the co- 
localisation of the GFP:Hs-unc-53/3 fusion protein 
with microtubules in N4 mouse neuroblastoma cells 
transfected with pGI3305. 
25 Figure 11a is an illustration of the homology 

domains between Hs-unc-53/3 and a gene encoded 
(partially) by the Drosophilia melanogaster BAC clone 
BACR48M05 (AC005719) . Results of a TBLASTN search on 
the non-redundant database with Hs-unc-53/3 as query. 
30 Figure lib is an illustration of an ORF encoded 

by the Drosophila melanogaster BAC clone BACR4 8M05 
(AC005719) as predicted by the computer program Fgene. 

Figure 11c is an illustration of a "BLAST 2 
sequences'' search result with Hs-unc-53/3 as query and 
35 the Fgene predicted UNC53 homology ORF of D. 
melanogaster BAC clone BACR4 8M05 . 

Figure 12 is an illustration of a zebra fish EST 
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encoding Dr-unc-53/2. 

Figure 13 Genemap98 results for Hs-unc-53/2. 

Figure 14 is a schematical drawing of the 
sequence of the exon containing the putative 
5 alternative start codon of human Hs-unc-53/1. 

Figure 15 is an illustration of the nucleotide 
sequence of pGI3150 and the amino acid sequence of the 
eGFP fusion with a C-terminal fragment of Hs-Unc-53/1. 

Figure 16 is an alignment of EST clone yk480b6 
10 and Ce-unc-53 demonstrating a novel splice variant of 
Ce-unc-53 . 

Figure 17 is a graphical display of the effect of 
Hs-unc-53/3 GFP chimera transient transfection on the 
form factor of N4 ceils. 

15 

DEPOSITED MATERIAL 

Plasmids pG13303 and pG13305 were deposited under 
accession numbers LMBP3936 and LMBP3937 respectively 
20 on 28 May 1999 at the Belgian Coordinated Collections 
of Microorganisms (BCCM) at Laboratorium voor 
Moleculaire Biologie - Plasmidencollective (LMBP) B- 
9000 Ghent, Belgium, in accordance with the provisions 
of the Budapest Treaty of April 28 1977, 

25 

Hs-UNC-53/3 is a bona fide UNC-53 (fig. 1; 2; 3) 

Blastn and Tblastn EST-database mining using the 
sequence of the already known animal UNC-53s led to 

30 the identification of 3 ESTs suggestive of novel unc- 
53s (see experimental procedures). By 3'- and 5'- 
RACE extension using suitable libraries, it was shown 
that these ESTs identified a novel unc-53 designated 
Hs-unc-53/3 (Fig. 1 e; f) . The publication of the 

35 sequence AB023155 (Nagase et al. 1999, DNA Res. 6:63- 
70) independently confirmed the correctness of the 3'- 
end of Hs-unc-53/3 as well as the existence of one new 
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intron that forms the 5' -end of AB023155. Alignments 
of the C. elegans and 3 human UNC-53 sequences (fig. 
2) clearly illustrates that the third human homologue 
of C. elegans UNC-53 protein is a bona fide UNC-53 
5 with highest similarity to Hs-UNC-53/2 and in 

decreasing order to Hs-UNC-53/1 and (C. eleaans UNC- 
53) Ce-UNC-53. 

Many of the domains of Hs-UNC-53/3 show highest 
similarity to functional domains of other animal UNC- 

10 53s (fig. 2) . This critically suggests that Hu-UNC- 
53/3 most likely has the key functionalities observed 
for Ce-UNC-53 in a variety of assays including F-actin 
binding, F-actin reorganisation in cell culture, 
microtubule and microtubule (+)-end binding in 

15 cultured cells, binding of SH3-domain adapters like 

SEM-5/GRB-2 or other types of binders of proline rich 
alpha-helices. These results indicate that like Ce- 
UNC-53, Hs-UNC-53/1, Hs-UNC-53/2, or Hs-UNC-53/3 can 
be used in a range of biochemical, cellular and animal 

20 assays aimed at discovering tissue- or disease- 
specific modulators of Hs-unc-53 functioning in 
diagnostic assays. 

Further extension of the Unc-53 family (Fig. 11, 
25 12) 

Database searches with the three human UNC-53 
protein sequences revealed several expressed sequence 
tags (ESTs) and genomic DNA sequences (BACs) that show 
30 significant similarlity to human UNC-53. 

C. briggsiae 

The C. elegans genome consortium sequenced the 
35 locus of the C. briggsiae unc-53 homologous gene. 
Through gene prediction programs and 

the cDNA sequence of the C. elegans unc-53, prediction 
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can be made for the C. briggsiae protein sequence. 
Alignment of the derived C. briggsiae 
amino acid sequence with the C. elegans amino acid 
sequence in figure 3 demonstrates the strong homology 
5 of both proteins. 

D . melanogaster 

BAC clone BACR48M05 (AC005719) clearly contains 3 

10 different exons with high homology to Hs-unc-53/3 
(Figure 11) . Using the gene structure prediction 
program Fgene [Solovyev et al., 1995, in: Proceedings 
of the Third International Conference on Intelligent 
Systems for Molecular Biology (eds. Rawling et ai . , 

15 Cambridge, England, AAAI Press) ; Solovyev and 

Lawrence, 1993, in: Abstracts of the 4th annual keck 
symposium. Pittsburgh, 47) it was possible to predict 
an ORF encoded by BAC clone BACR4 8M05 that shows 
homology to Hs-unc-53/3 (Figure lib) . However, every 

20 Drosophila cDNA partially or entirely encoded by BAC 
clone BACR4 8M05 and which contains one or more 
sequence blocks as indicated in figure 11a should be 
considered as a family member of the UNC-53 family. A 
"BLAST 2 SEQUENCE" search indicates that the sequence 

25 situated between the three homology blocks that are 
indicated in figure 11a is less conserved between 
human and Drosophila (Figure 11c) . The predicted ORF 
of the Drosophila melanogaster UNC53 gene can be used 
to identify new members of the family. The zebrafish 

30 EST fc21d06 (AI658309) shows an identity of 84% and a 
homology of 92% to Hs-UNC-53/2. It clearly can be 
considered as a part of the zebrafish homologue of Hs- 
UNC-53/2 (Figure 12) . Finally, a whole series of 
human ESTs have been placed in public domain 

35 databases- To our knowledge, no one has been able to 
place these ESTs into contigs that describe a true Hs- 
unc-53 to a level presented in this specification. 
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The presently available unc-53 sequences - expressed 
or genomic - further underscore that the unc-53 gene 
family is a true animal gene family in helminths, 
vertebrates and arthropods, three major classes of the 
5 animal kingdom. 



Refined UNC-53 family description based on 
alignment (fig. 4) . 

10 The alignment of the three human and the 

eleaans UNC-53 sequences enables the more refined 
definition of conserved regions in UNC-53s. In figure 
4 there are compiled a number of proSite signatures 
for either the four animal or the three human UNC-53s. 

15 

Differential expression of Hu-UNC-53/3 by 
Northern blot (fig. 5) . 

To determine in which cells and tissues the 
20 vertebrate UNC-53s play a role, a northern blot 

analysis has been performed. As indicated in the 
experimental section, relevant probes were amplified 
and used to visualise in which normal human tissues 
and in which cancer cell lines the three human UNC-53s 
25 were expressed. 

1, A cancer cell line RNA blots probed with Hs- 
Unc53/1 . 

A Northern blot of poly-A+RNA from several 
30 cancer cell lines (Melanoma G361, Lung Cancer A549, 
Colorectal Adenocarcinoma SW480, Burkitt Lymphoma 
DRajii, Leukemia Molt4, Lymphoblastic Leukemia K562, 
HeLa S3 and Promyelocytic Leukemia HL60) was probed 
using the whole insert of pHH3b . No or weak 
35 expression was detected in the Burkitt Lymphoma 
DRajii, the Leukemia Molt4 and the Promyelocytic 
Leukemia HL60 cell lines. Five different transcripts 
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are detected in the remaining cancer cell lines: 
transcripts 1 and 2 are larger than 9.5kb, transcripts 
3 and 4 are 6 to 7 kb and the fifth transcript is 
around 6 kb. Transcripts 1 and 2 are present in all 
5 expressing cell lines but at different levels. 

Transcripts 3 and 4 are restricted to Melanoma G361, 
Lung Cancer A54 9 (weak) and Colorectal Adenocarcinoma 
SW4 8 0 and are the predominant transcripts in Melanoma 
G361 and Colorectal Adenocarcinoma SW480. Transcript 
10 5 is restricted to Lymphoblastic Leukemia K562 (weak) 
and (predominant) in HeLa S3 and is predominant in 
HeLa S3. 

m v^chi^cl. v^c-l-l xj.nt;s rvLN-rt. .uj-ljus pi uutiu wj_l.ii ns~ 

15 Unc53/2. 

A similar set of cancer cell line Northern 
blots were probed with a 652bp fragment of EST46037 
amplified by using the primers 5'- 

aggagatgaagctgacagatatcc and 5' -aaacaccagtgagtcc . Hs- 
20 Unc53/2 is expressed in Melanoma G361, Colorectal 

Adenocarcinoma SW4 80, Lymphoblastic Leukemia K562 and 
HeLa S3. No expression was detected in Lung Cancer 
A549, Burkitt Lymphoma DRajii, Leukemia Molt4 and 
promyelocytic leukemia HL60. Interestingly only 2 
25 transcript sizes were detected of around 7 kb 

expressed in Lymphoblastic Leukemia K562 and HeLa S3 
and a transcript of >9.5 kb in Melanoma G3 61 and 
Colorectal Adenocarcinoma SW4 80 and weakly in HeLa53. 
Noteworthy is the very high expression in melanoma 
30 G361. 

3. Normal Human tissue probed with Hs-Unc53/1. 
A Northern blot of poly-A+RNA from normal 
human tissue was probed using the whole insert of 
35 phage HH3b. Expression levels are low in all tissues 
with the highest level in heart and placenta, several 
fold lower levels in brain and testis, even lower 
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levels in skeletal muscle/ pancreas, thymus, colon, 
small intestine, ovary and prostate. Expression in 
peripheral blood leukocyte, lung, liver, kidney, 
spleen is barely detectable. 

5 

4. Normal Human tissue probed with Hs-UNC53/2. 
A similar set of blots were probed with a 
652bp fragment of EST46037 amplified by using the 
primers 5' aggagatgaagctgacagatatcc and 5'- 

10 aaacaccagtgagtcc. Expression levels are low in all 

tissues with the highest level in kidney, placenta and 
pancreas, lower levels in heart and lung. Expression 
is barely detectable or undetectable in skeletal 
muscle, spleen, thymus, prostate, testis, ovary, small 

15 intestine, colon peripheral blood leucocyte, stomach, 
thyroid, spinal cord, trachea, adrenal gland and bone 
marrow. Also Hs-unc-53/2 appears to be expressed as 
different transcripts (figure 5a) . 

The hs-UNC53/l and hs-UNC-53/2 homologues are 

20 clearly highly regulated genes, showing a strong 
tissue specificity and, probably, additional 
mechanisms of regulation (ie differential splicing of 
different promoters) . The different proteins derived 
from RNA' s identified by probe hhl5 presumably share 

25 the carboxyterminal nucleotide binding domain. 

Ce-UNC-53 was shown to be a complex genetic locus and 
complex transcription unit. The different transcripts 
are thought to be a mechanism to assure the necessary 
specificity and functional diversity of this signal 

30 transduction pathway, with respect to different 

signals and receptors, different tissues and different 
directions of migration. The occurrence of a new 
transcript or the observed changes in expression 
levels in the cancer cell line blot suggests a role 

35 for hs-UNC-53/3 in the establishment or maintenance of 
the transformed state of those cells. 
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Expression pattern of hs-UNC-53/3. 

A northern blot of poly-A+RNA from several cancer 
lines was probed with unique fragments of the three 
5 genes from the Hs-unc-53 family. Hs-unc-53/3 has a 
high expression level in lung carcinoma line A549, 
where only a moderate expression of hs-unc-53/1 has 
been detected. Furthermore/ moderate expression of 
Hs-unc-53/3 was also observed in melanoma line G361, 
10 where previously, a high expression of hs-UNC-53/1 and 
hs-UNC-52/2 has been observed. This indicated the 
involvement of hs-unc53/3 in at least two cancer 
lines . 

In normal human tissues, the expression of hs- 

15 unc-53/3 shows a clearly new and previously unobserved 
expression pattern. This difference of expression of 
hs-unc-53/3 in relation to its homologues hs-unc53/l 
and hs-unc53/2 is important for the allocation of 
functionality to hs-unc-53/3. 

20 Hs-unc-53/3 is highly expressed in brain, as 

shown on the Northern blots (figure 5a) . In figure 5b 
it can be seen that Hs-unc-53/3 also is differentially 
expressed in different parts of the brain. Its 
homologues are not or weakly expressed in brain. This 

25 gives an indication that its function in 

directionality of cell migration and growth cone 
steering will be in relation to specific regions or 
cells of the brain. It is deduced that Hs-unc-53/3 
will be an important signal transducer or signal 

30 adapter linking signals to neuronal outgrowth, axon 
guidance, and formation and maintenance of synaptic 
connections. It seems that the function of Hs-unc- 
53/3 will be associated with neuron-neuron 
interactions, neuronal outgrowth, neuron muscle 

35 interactions, and post-synaptic signal transduction. 
Furthermore, Hs-unc-53/3 may be involved in the 
development of cancer of neuronal origin, like 
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neuroblastomas, or the development of tumours will 
have their developmental origin in the brain as some 
eyes diseases like retinoblastomas. 

The significance of the high expression of Hs- 
5 unc-53/3 in brain tissue can be associated with the 

high levels of expression which has also been observed 
in the spinal cord, containing neuronal tissue. Here, 
neuronal (axon) outgrowth and neuron-neuron 
connections are of importance. Development of 

10 pharmacological tools acting on this pathway may lead 
to treatments of diseases involved in the growth and 
movement of neuronal cells, and the regeneration of 
neuronal connectivity after trauma, or the inhibition 
of neuronal cancers such as neuroblastomas. Due to 

15 its specific expression, inhibitors and/or enhancers 
specific for Hs-unc-53/3 will have an advantage as a 
pharmaceutical compound over more general compounds 
acting on the Hs-unc-53 family of genes and proteins. 
A second tissue where hs-UNC-53/3 is highly 

20 expressed and where (its) other human homologues are 
not expressed is the spleen. Hs-UNC-53/3 could 
therefore function as part of the signal transductions 
pathway involved in the maturation of leukocytes. 
Malfunction of this pathway may lead to incorrect 

25 maturation of the leukocytes and the development of 
autoimmune diseases such as rheumatoid arthritis and 
sclerosis. Next to the signalling function in the 
recognition of the leukocytes, Hs-UNC-53/3 may also 
play an important role in the induction and/or 

30 signalling pathway of the mechanism underlying 

apoptosis of leukocytes in the spleen. Pharmaceutical 
methods involving the hs-UNC-53/3 pathway, which may, 
for example, result in an inhibition and/or 
enhancement of its expression may lead to treatment of 

35 these disorders. Furthermore, hs-UNC-52/2 may have an 
advantage, as an inhibitor or enhancer specific for 
hu-unc53/3 which will act in a more specific manner. 



BNSDOCID: <WO 9963080A1_I_> 



WO 99/63080 




PCT/EP99/03848 



- 34 - 



The Hu-UNC-53/3 protein is also highly expressed 
in the ovary, where the two other human homologues are 
also expressed. Finally moderate to low expression of 
hs-unc53/3 is observed in heart, placenta, testis, 

5 stomach and adrenal gland. 

Although the predominant transcripts of Hs-unc- 
53/3 are > 9 kb, often a smear occurs that ends at 
with somewhat higher intensity at 5.5 - 6.5 kB. This 
short transcript may correspond to AB023155. 

0 The Hs-unc53/3 gene is a highly regulated gene, 

showing strong tissue specificity and additional 
mechanisms of regulation which have not previously 
been identified in any of its known homologues. These 
findings may thus lead to the development of more 

5 specific inhibitors or enhancers of hs-UNC-35/3 and or 
of the Hs-UNC-53/3 pathway. The Northern blot studies 
indicate that the three human unc-53s are complex 
transcriptional units with highly regulated tissue 
specificity and that transcripts of different lengths 

0 exist. 

Splice variants of human unc-53s 

Whilst cloning Hs-unc-53/3, it became apparent 
5 that at least three expression variants of Hs-unc-53/3 
- most probably alternative splices - exist (fig. le, 
f ; lowercase regions) . Targeted efforts for the two 
other human UNC-53s demonstrated that the other human 
UNC-53s contained variants (fig. la, c and e regions) . 
0 Splice variants as observed to date appear to be 

concentrated in specific regions. A first one 
(starting at position 1252 in fig. 2) - in which the 
overall amino acid similarity is weak - contains 2 
(splice) variants of both Ce-unc-53 and Hs-unc-53/3. 
5 In the worm, the presence or absence of these 2 exons 
in unc-53 regulates the function of the UNC-53 protein 
in such a way that cells differentially translate 
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extra-cellular signal gradient as an attractive or 
repulsive signal . The most 3' -variant of Hs-unc-53/2 
roughly covers the 2 Ce-unc-53 variants. 

The complexity of variation in this zone of Hu- 
5 UNC-53 might resemble the situation in the nematode. 

In Hs-unc-53/3, for example, the region from position 
3795 to 4325 (figure le) consists of two adjacent 
blocks (3795 to 4283 and 4286 to 4325 in figure le) 
that can independently be present in or absent from 

10 cDNAs from frontal cortex tissue. In contrast, no 

variants were as yet observed in this zone for Hu-UNC- 
53/1 or /2. 

The second variant in Hs-unc-53/3 (fig. 2) 
deletes a box (MQLDNRTLPKKGLR) , which is extremely 

15 conserved (in bold) among all human unc-53s. This 
occurrence of this variant could indicate 
differentially active functional variants of Hu- 
unc53/3. 

A second region in which splice variants were 

20 observed contains a major highly conserved domain of 
unc-53s. Hs-unc-53/1 has a first variant that 
comprises the most N-terminal portion of this 
conserved domain (SGSFRD) . A second splice variant in 
Hs-unc-53/1 (AEERMOSE) lies within the highly 

25 conserved domain. Another conserved spot for splice 
variation in human unc-53s has been found (figure 2) : 
Hs-unc-53/1 { VYE } ; -/2 { VNE } and -/3 {NSRGSEL} . All 
these spliced exons are flanked by two conserved 
charged domains - putative nuclear localisation 

30 signals. Given this conservation, we searched for 

splice variation in C. elegans and found it to exist 
in the form of an extra exon (ALSVDSQ) (figure 2) . 
Hu-unc-53/3 has another variant 
( S P L VW P PKKRQNG P V I YKH S R ) (fig. 2). 

35 The most 3' splice variant in Hs-unc-53/3 has 

been discovered whilst cloning Hs-unc-53/3 and was 
shown to be present uniquely in human heart cDNA 
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libraries . 

Single nucleotide polymorphisms 

5 Cloning and PCR studies indicated the existence 

of a non-silent single nucleotide polymorphism in Hs- 
unc-53/1 in position 1232 and in Hs-unc-53/2 in 
position 929. This indicated that variations exist in 
human unc-53s which - in some cases - may be relevant 
10 to the proper functioning of the UNC-53 protein and 
hence in disease . 

Expression in normal and neoplastic cells by RT- 
PCR 

15 

The cloning efforts demonstrated the existence of 
splice variants in the human unc-53s and the Northern 
blots revealed a range of transcripts for each human 
unc-53. The combined data do not explain completely 

20 the range of transcripts observed. Therefore, our 
understanding of the expression complexity of human 
unc-53s may be incomplete and more detailed RT-PCR 
studies were performed. 

One of the obscuring factors could have been that 

25 all studies performed on mRNA or cDNA of whole tissues 
which are built of different normal human cell types 
that occur in different proportions. For this reason 
and because skin was not covered in the Northern blot 
studies, a RT-PCR study was set up using cDNA 

30 preparations of the different cells in skin normal 

human: (1) epidermal keratinocytes, (2) melanocytes, 
(3) dermal fibroblasts. In addition, lineage matched 
transformed cell lines or tumour cell lines were 
included in the study to compare normal versus 

35 neoplastic cells. Human umbilical vein endothelial 
cells (HUVEC) were taken as a normal human match for 
endothelial cell lines. 
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The RT-PCR study for Hs-unc-53/1 revealed that 
the most 5' -splice variant is differentially expressed 
in normal versus neoplastic cells/cell lines. This 
exon is present in 7/7 keratinocytes, HUVEC and in 
5 melanocytes but lacking in HaCat, ECV304, 2/7 melanoma 
and MCF-7 cells (breast carcinoma) . 

The RT-PCR study for Hs-unc-53/2 revealed a more 
surprising picture. The tumourigenic endothelial line 
ECV304 lacks expression of Hs-unc-53/2, whereas their 

10 normal counterpart HUVEC expresses Hs-unc-53/2, 

suggesting gene deletion or inactivation of expression 
in ECV304. In epidermal keratinocytes and the lineage 
matched spontaneously transformed keratinocyte HaCaT 
and MCF-7 lack expression of the 5' -end of Hs-unc- 

15 53/2, but express the 3' end (starting in or near the 
microtubule-binding domain) . This suggests that like 
AB023155 for Hs-unc-53/3, also Hs-unc-53/2 can be 
expressed as a truncated 3' -variant in a cell-specific 
way. Also splice variation of Hs-unc-53/2 appears to 

20 differ in a normal to neoplastic way: the { VNE } exon 
was shown to be present in all keratinocyte isolates 
but not in HaCaT and also melanocytes express it, but 
not 2/7 melanoma or MCF-7. The RT-PCR studies for Hs- 
unc-53/3 were focussed on demonstrating expression of 

25 AB023155 in tissues other than brain. The new exon 
described was shown to be present in keratinocytes, 
HUVEC, dermal fibroblasts, melanocytes and their 
transf ormed/neoplastic variants, demonstrating its 
wide expression in tissues in man. 

30 

Alternative 5' -start exons 

For Hs-unc-53/2 five different start exons have 
been cloned using RT-PCR, three of which have been 
35 confirmed to be present in at least 2 different cDNA 
libraries (figure lb, c) . Likewise for Hs-unc-53/3 
different 5' -exons were found, two of which were 
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confirmed (figure le, f ) . These 5'-exons most 
probably indicate that human unc-53s are being 
expressed via the control of alternative promoters 
that lie 5' of these different 5'-exons. Also in the 
5 nematode has been shown that different (intronic) 

promoters are driving the expression of 5' -variants of 
C. elegans unc-53. 

The Hs-unc-53/1 5' -end 

10 

Despite considerable efforts, cloning has not 
lead to the identification of a bona fide 5' -end for 
Hs-unc-53/1 that comprises an F-actin binding domain, 

15 existence of transcripts > 9.5 kb . Given that both 
Hs-unc-53/2 and -/3 are expressed as full length and 
truncated forms, the question can be raised whether 
Hs-unc-53/1 may not be expressed in a short form as 
well * 

20 cDNA library cloning and 5' -RACE has provided 

contiguous sequence that ends at a position that 
matches with a domain in C. elegans un-53, where an 
alternative start position lies. Based on this 
argument, Hs-unc-53/1 could be a functional equivalent 

25 in man of this transcript in nematode. 

To further trace the "longer" variants of Hs-unc- 
53/1, genomic BAC DNA sequencing has been performed. 
In figure lg is shown sequence of a4984 fragment from 
BAC 585E09. It comprises sequence 5' of the presently 

30 known cDNA of Hs-unc-53/1. To the qualified as well 

as by means of two groups of gene structure prediction 
computer programs, different but comparable exons in 
the 4 984 bp genomic sequence fragment can be predicted 
(figure 14) . The programs GENSCAN, HEXON and MZEF all 

35 predict an exon between bp 1089 and bp 1880. The end 
of this predicted exon (bp 1880) is confirmed by the 
cDNA sequence. Therefore this predictions has a big 



BNSDOCID: <WO 9963080A1 J_> 



WO 99/63080 




PCT/EP99/03848 



- 39 - 

change to indicate the correct exon length* The 
programs GRAIL, GENE FINDER and HMMGENE all predict -an 
exon between bp 1123 and bp 2031. None of the 
predicted exons contains an in frame stop codon 5' of 
5 the alternative start codon. Consequently, it is 

possible that there exist unidentified exons 5' of the 
exon containing the alternative start codon. 

The present picture critically suggests that both 
nematode and human unc-53s appear to be complex 
10 transcriptional units. Moreover, the fact that some 
of the most complex splice variants map to similar 
regions in the UNC-53 proteins points to evolutionary 
conserved functional variants of UNC-53s e.g. with 
regard to the cells directional migration towards or 
15 away from a signal source. In contrast, some of the 
variants in the human UNC-53s are located in highly 
conserved domains; these (and other) variants may 
create discrete - yet undiscovered - functionally 
different UNC-53 proteins transcribed from one of the 
20 unc-53 genes. 

The fact that two and maybe three human unc-53s 
exist as full size and a truncated forms with cell- 
specific expression, that series of alternative 5'- 
start exons exist eventually controlled by different 
25 promoters that some forms of splice variation are 

conserved from nematode to man, all indicate that the 
expression of unc-53s is of very high complexity and 
that some of the biological functions of UNC-53 
proteins are extremely conserved. 
30 On the other hand, the differential expression in 

Northern blots, the splice variation difference 
between normal and lineage-matched neoplastic cells 
and the non-silent single nucleotide changes in two of 
the three human unc-53s, all indicate how important a 
35 wide range of diagnostic assays can be to understand 
in depth the role in disease of human unc-53s. 
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Chromosomal localization of Hs-unc-53/2 by 
Genemap98 (Fig. 13 and 1(c)) 

The EST clones AA918601, AI248585, AA115014 and 
AA115015 are clearly homologous to the 3'-UTR of Hs- 
Unc-53/2 cDNA (Figure 1(c))). Although, AA115014 
(describing the same EST as AA115015) contains an 
alternative splice variant of the Hs-Unc53/2 gene in 
the 3'UTR. A survey with ESTs AA918601, AI248585, 
AA115014 or AA115015 as query in the genemap98 
database (release November 1998) revealed that the Hs- 
Unc53/2 gene is located at chromosome 11 
(http^//www.ncbi .nlm.nih. gov/genemap98/loc . cgi?ID=2122 
4) . The STS which is used for chromosomal 
localization and which is situated in the 3'UTR of the 
Hs-Unc53/2 gene is referred to as SHGC-33456 (dbSTS 
Id: 41891, Genbank Acc: G28036, Genbank gi : 1396755) 
(Figure 13a) . The STS was localized by analysis on 
the NIGMS human/rodent somatic cell hybrid panel 
(dbSTS Id: 41891) . The Radiation hybrid results are 
summarized in Figure 13b. Together these data imply 
that every disease or phenotype connected to SHGC- 
33456 is due to the Hs-Unc-53/2 gene. 

Functional Characterisation of Hs-unc-53/3 

F-actin reorganisation and microtubule binding of 
Hs-unc-53/3 

Based on its structural features, Hs-unc-53/3 can 
be classified as a bona fide human unc-53. To further 
understand its function and in anticipation of 
developing pharmacological compound screening assays, 
Hs-unc-53/3 has been physically cloned following the 
method described in the experimental section and shown 
in figure 7a. The derived Hs-unc-53/3 clones 
comprising full length (A to L and the 3' -half (G to 
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L) of Hs-unc-53/3 were further engineered to form a 
chimera with green fluorescent protein and cloned into 
expression vectors appropriate for transfection of 
eukaryotic cells. The nucleic acid and amino acid 
5 sequences of these constructs are shown in figure 7b- 
e. The constructs were transfected into cells and 
scored for their effects on the F-actin cytoskeleton 
and binding to microtubules of mouse neuroblastoma 
cells N4; functions known for nematode unc-53 and 

10 human unc-53/1. 

The N4 cell transfected with a GFP fusion to the 
3' -half of Hs-unc-53/3 (pGI3303, fig. 7b) showed 
pronounced filopodia and lamellipodia outgrowth, which 
is associated with reorganization of the F-actin 

15 cytoskeleton (Figure 8) . This observation 

demonstrates that like nematode unc-53 and human unc- 
53/1, the F-actin binding domain is not required for 
inducing reorganization of the F-actin cytoskeleton of 
N4 cells. In addition, the pGI3303 encoded fusion 

20 protein does not co-localize with microtubuli but 

localizes to the cytoplasm of N4 cells indicating that 
an important domain for microtubuli association is 
missing in this C-terminal fragment of Hs-unc-53/3. 
In the alignment figure 2 can be seen that the C- 

25 terminal half of Hs-unc-53/3 (approximate KLAA0938) 
does not comprise the conserved microtubule binding 
domain. 

In contrast, the N4 cells that expressed low to 
medium levels of the GFP fusion to full length Hs-unc- 

30 53/3 (pGI3305, Fig. 7d) displayed a co-localization of 
the GFP fusion protein with microtubules (Figure 9) . 
Even the centrosomes could clearly be detected in some 
transfected cells. Cells expressing very low amounts 
of the fusion protein displayed specific microtubule 

35 (+)-end binding (Figure 9). The morphology of the 

PGI3305 transfected N4 cells does not clearly differ 
from the control transfected cells although there is a 
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tendency towards rounding up of the pGI3305 
transfected cells and filopodia outgrowth. 

Validation of functional assays as compound 
screens 

R74288 has previously been shown to be an 
inhibitor of nematode function in C. elegans 
(W096/38555) , an activity that has been confirmed in 
Ce-unc-53 transfected N4 cells, where only the 
transgene-induced effect was inhibited by R74288. In 
order to confirm compound R74288s activity in a full 
mammalian system, a stable transfection of plasmid 
pGI3150 was performed in the N4 neuroblastoma ceil 
line with the lipof ectamin procedure (Gibco BRL) . 
PGI3150 expresses an eGFP protein in fusion with the 
C-terminal end of Hs-unc-53/1 (see Figure 15a) . After 
two weeks of G418 selection, 20 clones with stable 
integration of the pGI3150 plasmid were selected and 
isolated. These clones were tested for GFP expression 
by fluorescence microscopy and by Western blotting 
with an anti-GFP antibody (table 1) . The lamellipodia 
outgrowth phenotype was checked visually (See Figure 
15b) . Compound R74288 was tested on four random 
selected pGI3150 stably transfected clones: 8.1, 8.2, 
8.3 and 10.1 and on a pool of pEGFPCl stable 
transfected N4 control cells. Clones 8.2 and 10.1 
displayed less lamellipodia outgrowth than clones 8.1 
and 8.3. Compounds and solvents were added to the 
stably transfected cells (10^ in DMSO) . After 24 hrs 
of incubation, two persons independently scored the 
effect of the treatments on the cells. As shown in 
table 1, both persons noticed an effect compound 2 on 
clones 8.2 and 10.1 with a weak transgene-induced 
lamellipodia phenotype. This effect consisted of a 
more flat morphology of the treated versus untreated 
cells. Compound 2 was R74288. 
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Table 1. Effect of compounds on lamellipodia 
formation 



Clone 


Compound 


Compound 


Compound 


Compound 


GFP 


GFP 


Phenotype 




X 


2 


3 


4 


fluo 


Western 




8.1 


0 


0 


0 


toxic 


+ 


+ 


+ 


8.2 


0 


+ 


0 


toxic 


++ 


+++ 




8.3 


0 


0 


0 


toxic 


++ 


++ 


++ 


10.1 


0 


+ 


0 


toxic 


+ /- 


+ 


+/- 


GFP pool. 


0 


0 


0 


toxic 









10 

Automated compound screening by measuring cell 
morphology 

15 Compound screening assays must have a 

sufficiently high throughput to be relevant to drug 
discovery. To achieve this goal, we automated the 
procedure of measuring the morphological changes 
induced in cells following transient transfection wit! 
20 full length or 3' -half of Hs-unc-53/3 GFP chimeras. 

The cell culture, transfection, fluorescence staining 
and microscopy procedures are performed within a 96- 
well plate (all-in-one) . The fluorescent staining 
method comprises a triple fluorescent labeling 
25 procedure (1) for cell nucleic using DNA double helix 
intercalating dyes such as Hoechst 33342 or DAPI, (2) 
for transfection efficiency and expression level of 
the chimeric protein using GFP fluorescence and (3) 
for the F-actin cytoskeleton using f luorescently 
30 labeled phalloidin, a microfilament dye. 

These three different fluorescent images are 
collected using an motorised stage plus stage driver 
and a frame grabber that produces seamless composite 
images of the cells in the well. The software 
35 programs to drive this operation are known in public 
domain as "SCIL" (University of Amsterdam) . The 
seamless images are then superimposed using 
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pseudocolour for the operator to inspect the quality 
of the culture. In addition, the SCIL program was 
compiled in such a way that it: (1) identifies cells 
by means of their nucleus, (2) measures the GFP 
fluorescence intensity, (3) delineates the area of the 
F-actin (phalloidin) staining surrounding a nucleus and 
(4) calculates a range of parameters objectively 
representing the features of the F-actin staining 
pattern of each individual cell. One example of such 
a parameter is called the "form factor". it is an 
arbitrary value that reflects the dendricity of a 
cell. It is derived by calculating (A) the true 
circumference of a cell's F-actin staining area as 
seen in the image and (B) the area of the F-actin 
staining of that given cell. The ratio 4xPIx(B) 2 = the 
form factor. For a rounded cell, the form factor 
approximates 1 whereas, for a cell with increased 
filopodia and lamellipodia outgrowth, the true 
circumference will be much larger than that of a 
circle and as a result, the form factor « 1. 

In experiments it was shown that transiently 
transfected N4 cell populations indeed displayed a 
different form factor versus control cells. Both the 
median and average form factor for a cell population 
in a well were reduced following transfection with the 
3' -half of Hs-unc-53/3. More in particular, there was 
a significant decrease in the number of cells in a 
transfected culture that displayed the minimal form 
factor, suggesting that the Hs-UNC-53/3 transgene 
induced round cells in particular to become more 
dendritic (figure 16) . 

Chromosomal localisation of Hs-unc-53/3 by FISH 
indicative for a role disease 

With FISH technology using a unique fragment of 
hs-unc-53/3 we are able to localize the hs-unc53/3 
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gene on chromosome 12q21.1. Chromosome 12q21.1 is a 
region shown to be involved in autosomal dominant, 
cornea plana and closed angle glaucoma (Sigler- 
Villanueva et al., Ophthalmic Genetics 18:55-62, 
5 1997). This indicates that hs-UNC-53/3 protein may be 
involved in eye development and thus eye diseases, 
such as retinoblastomas. Neuroblastoma cell line NPG 
and liposarcoma line WDLPS and other sarcoma lines 
have amplifications in this region. The neuroblastoma 
10 amplification seems to be located more distal (12q24) 
while the liposarcoma line is located at 12q21 (Van 
Royal et al . , Cancer Genetics and Cytogenetics 82:151- 
4, 1995). Three loci related to Darier' s disease, an 
autosomal dominant genodermatosis disease 
15 characterized by epidermal acantholysis and 

dyskeratosis have been mapped in region 12q21-q24 
(Wright et al . , Journal of Investigative Dermatology 
103:665-8). 12q21 is also known to be a fragile site 
associated with the pathogenesis of non-Hodgkin' s 
20 Lymphoma (Chary-Reddy et al . , Cancer Letter 86:111-7 
1994) . Duplications related to nephroblastoma 
tumorgensis were commonly found in the 12q21-q23 
region (Austruy et al., Genes Chromosomes Cancer 
14:285-294, 1995) . In a girl with mental retardation, 
25 a conclusive disorder and clinical findings resembling 
cerebral palsy, positioning of segments from other 
autosomes adject to the band 12q21 were found 
(Biederman et al., Ann Genet 19:257-260, 1976). 
Cytogenetic analysis for myeloid leukemia showed a 
30 complex caryotype with chromosomal breakpoints at 

12q21 (Weinstein et al . , Cancer Genet Cytogenet 48:75- 
81, 1990) . Finally, analysis of complex chromosomal 
rearrangements in malformed children and from 
spontaneous abortions showed specific breakpoints at 
35 site 12q21 Gorski et al., Am J Med Genet 29:247-261, 
1997) . Most of these diseases have been shown to be 
involved with cell movement, aberrant development, or 
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cell-cell contact and neuronal tissue or neuronal 
development . 

Confirmation of FISH with Radiation hybrid panels 

5 

To confirm and refine the chromosomal 
localisation of the human unc-53s an alternative 
method for FISH has been used. Radiation hybrid (RH) 
mapping is a somatic cell hybrid technique that was 

10 developed to construct high-resolution, contiguous 

maps of mammalian chromosomes. RH mapping provides a 
method for ordering DNA markers spanning millions of 
base pairs of DNA at a resolution to easily obtained 
by other mapping methods. Some of the advantages of 

15 RH mapping are (1) distance estimated by this method 
is directly proportional to physical distance, (2) 
nonpolymorphic DNA markers, that can not be used for 
meiotic mapping, can be used for this method, and (3) 
a high resolution map that is not easily made by other 

20 methods can be obtained. 

The results of FISH and RH mapping for the three 
human unc-53s are summarised in table AA. By using 
publicly available databases (see experimental 
section) one can derive information on the correlation 

25 between FISH and RH mapping. RH Mapping was shown in 
this way to confirm the FISH data for the three unc- 
53s . 
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Table 2* RH Mapping Primers and Results 





Unc-53 


FOR 
Primer 


REV primer 


PCR Results 


Marker* 


FISH 


5 


Hs-UNC-53/1 
(BACS85E9) 


5 ' TGTGGGT 
GAGGAATGC 
TGAC 


5 ' CAGAGCTT 
GCTCTAGAGG 
AC 


51, 62, 66 


SHGC-30236 


lq31-32 




Hs-unc-53/1 
(BAC585E9) 


5'CCTGCCC 
AACATAGCA 
AGAC 


5 ' CCATCTAC 
AATGAGCCAG 
AC 


51, 62, 66 


SHGC-30236 


lq31-32 




Hs-unc-53/2 
G411 


5' CTGCCTC 
CCTTTGCTG 
TGTTGCATG 


5 ' CTGAGCAG 
AGTGAAGCCA 
GAGTTGG 


8, 28, 29, 43, 
44, 51, 59, 
66, 70, 77, 83 


AFM022th2 


llpl5. t 


1 0 


F4.1.2 


5 ' TCATGTA 
TTCCCCACA 
GACAAGCC 


5 ' CATTGTGT 
CTTGATACTT 
TGGGGTGC 


8, 28, 44, 51, 
59, 65, 83 


SHGC-31021 


llplS.l 




D4.1.1 


5 ' GAGGATT 
TTATTTCTG 
GGAAATGGA 
ATCGG 


5 ' TGATCTTC 
CACTCCGTGG 
ATAACT 


8, 27, 28, 29, 
43, 44, 51, 
59, 65, 70, 83 


AFM022th2 


llplS.l 


15 


Hs-unc-53/2 , 
J4.1.4 


5 ' AAAGCCC 
AAGCCCCGG 
GAGAAGATG 


5 ' AACCCGTT 
TTCCACCGAG 
CCGCTC 


8, 27, 28, 43, 
44, 51, 59, 
66, 70, 83 


AFM022th2 


llplS.l 




Hs-unc-53/3, 
A215 


5 ' ACTTGCT 
GAAACAGAG 
AGCTCCATG 


5 ' CTTGCTGT 
CTTCTTTCTC 
CTTGGC 


1, 48, 50, 51, 
59, 65, 66, 
73, 74, 76, 78 


SHGC-17536 


12q21.1 




Hs-unc-53/3, 
A211 


5 ' TGATCTT 
CTAGCGTGT 
GACTCACTG 


5'ATCATTCC 
TTGGAGT 


1, 48, 50, 51, 
59, 73, 76, 78 


SHGC-17536 


12q21.1 



20 (*) list not exhaustive 



Also sequence information available in public 
domain can help refine the positioning of the unc-53 
genes, like in the following example. The EST clones 

25 AA918601, AI248585, AA115014 and AA115015 are clearly 
homologous to Hs-Unc53/2 cDNA. Although, AA1 15014 
(describing the same EST as AA115015) contains an 
alternative splicevariant of the Hs-Unc53/2 gene in 
the 3'UTR. A survey with ESTs AA918601, AI248585, 

30 AA115014 or AA115015 as query in the genemap98 

database (release November 1998) revealed that the Hu- 
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unc53/2 gene is located at chromosome 11 
(http: / /www.ncbi .nlm.nih. gov/genemap98/loc .cgi?ID=2122 
4) . The STS which is used for chromosomal 
localization and which is situated in the 3'UTR of the 
5 Hs-Unc53/2 gene is referred to as SHGC-33456 (dbSTS 
id: 41891, Genbank Acc: G28036, Genbank gi : 1396755) 
(Figure 13) . The STS was localized by analysis on the 
NIGMS human/rodent somatic cell hybrid panel (dbSTS 
id: 41891) . The radiation hybrid results are 
10 summarized in Figure 13. Together these data imply 

that diseases or phenotypes connected to SHGC-33456 is 
due to the Hs-Unc53/2 gene. 

EXPERIMENTAL PROCEDURES 

15 

Cloning & sequencing of Hs-unc-53/3 

Hs-unc53/3 has been cloned starting from a series 
of ESTs that were similar but not identical to Hs-unc- 
20 53/1 or -/2. The ESTs were: 

1. WashU-Merck EST 767735. 

Transformed cells carrying the EST 767735 
25 sequence were ordered from Research Genetics. Plasmid 
DNA was isolated using standard protocols (Qiagen 
plasmid DNA isolation kit) , the sequence of the insert 
was determined . 

30 2. ATCC cDNA clones 86459. 

Transformed cells carrying the cDNA clone 
8 6459 sequence were ordered from ATCC. Plasmid DNA 
was isolated using standard protocols (Qiagen plasmid 
35 DNA isolation kit), the sequence of the insert was 
determined. 
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3. Genethon cDNA clone c09a03 from the 
Geneexpress cDNA program. 

Transformed cells carrying the cDNA clone 
5 c09a03 sequence were ordered from Genethon. Plasmid 
DNA was isolated using standard protocols (Qiagen 
plasmid DNA isolation kit) , the sequence of the insert 
was determined. 

!0 These ESTs were extended to form one ORF as 

follows : 

1. 5' extension of EST 767735 by RACE (Rapid 
Amplification of cDNA Ends) . 

15 

Marathon-Ready cDNAs (Clontech) are premade 
"libraries" of adaptor-ligated double-stranded cDNA 
ready for use as templates in RACE experiments. Five 
ml Marathon-Ready cDNA was used as template in a 

20 regular 50 ml RACE. The RACE mixture contained 1 x 
KlenTaq PCR buffer. 0.2 mM of each dNTP, 1 x 
advantage KlenTaq polymerase mix (Clontech), 0.15 mM 
API adaptor primer and 0.15 mM RACE gene specific 
primer. The amplification conditions were as follows: 

25 94°C for 30 s and 68 °C for 4 min. One-hundred- fold 
diluted RACE product was used as a template in a 
nested PCR with AP2 adaptor and gene specific nested 
PCR primers. Specific nested PCR fragments were 
cloned into pCR2 (TA cloning kit, Invitrogen) and the 

30 sequences of the inserts were determined. Gene- 
specific primer (hh3UNC53 97101702): 

5 ' ACCATTTACACCTGAAGACGATTGAGGTCC ; nested gene-specific 
primer (hh3UNC53 97101701) 

5'CTCCTATTTAAATTAGAGGCTCCCTGGACC Marathon cDNA 
35 library: human placenta, human heart, human chronic 

myelogenous leukemia, human colorectal adenocarcinoma. 
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2. 3' extension of EST 767735 by RACE . 

Method as described previously. Gene 
specific primer (hh3UNC53 97102702) 
5 5'CAATCGTCTTCAGGTGTAAATGGTAACGTG; nested gene specific 
primer (hh3UNC53 97102703) 

5'GAATGTCAAACACAGTGCCACCTCCACC Marathon cDNA library: 
human placenta, human heart, human HeLa, human 
melanoma. 

10 

3. 3' extension of cDNA clone c09a03 by RACE. 
Method as described previously, gene- 

w r^ w * *- - 1 - ^ \ J.J.JLA^ UiM^v/ J Z* U W^. \J *■* U X / 

15 5 ' AGGGAGCACTGAATGGTCCAGACCATCCTC ; nested gene-specific 
primer (hh3UNC53 98020402) 

5'GCATCAGAAGACAGCATTCCTCTGAAAGTG Marathon cDNA 
library: human placenta, human heart, human HeLa, 
human melanoma, human colorectal adenocarcinoma, human 
20 chronic myelogenous leukemia. 

4. 5' extension of cDNA clone 86459 by RACE 

(1) . 

25 Method as described previously gene-specific 

primer (hh3UNC53 98020403) 

5'TTCAATTTCTATCTCTATGAGTTTTCTTCG; nested gene-specific 
primer (hh3UNC53 98020404) 

5' GCAGCTCTAGATTTGGTGATGAAGAAACTC Marathon cDNA 
30 library: human placenta, human heart, human HeLa, 

human melanoma. Overlapping sequences were assembled 
in a single contiguous sequence. 

5. 5' extension cDNA clone 86459 by RACE (2) . 

35 

Method as described previously gene-specific 
primer (hh3UNC53 98022502) 
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5' TCAGAATGTGATGAAGGAGGCTTGGTGGAC; nested gene-specific 
primer (hh3UNC53 98022501) 

5'GGATGCCGGAAGGGATGAATCAGTAAGC Marathon cDNA library: 
human placenta, human heart, human HeLa, human 
5 melanoma, human colorectal adenocarcinoma, human 
chronic myelogenous leukemia. 

Validating variants at 5' end of the cDNA 
sequence 

10 

In the final 5' RACE experiment, 2 variants have 
been found whose sequence diverge upstream from the 
IYTDWAN protein sequence (position 289 in figure le or 
position 82 in figure If) - By using primers 

15 AT T T AC AC T G AC T G G G C C AAC and ATAATCTGGATGATTTCTGCTAGGAGT 
on cDNA clones a Hs-unc-53/3 specific PCR product was 
obtained that was radiolabeled using the random primed 
DNA labeling kit (Roche Molecular Biochemicals) and 
hybridized to human DNA BAC filters (Research 

20 Genetics) . Both primers are located near the IYTDWAN 
box. Four BACs turned out positive (415J11; 464C17, 
525C02 and 537B02) . DNA sequencing of the region 
upstream from the IYTDWAN protein sequence directly on 
these BACs showed that this region was preceded by a 

25 putative intronic sequence as evidenced by the 

multiple stop codons in the reading frame and by the 
consensus AG intron acceptor sequence. For sequencing 
purposes, BAC DNA was prepared according to a modified 
Qiagen plasmid DNA procedure. 

30 A primer pair was designed specifically to 

amplify the 5' end of the variant shown in full in 
figure le (primers AC T T G C T G AAAC AG AG AG C T C CAT G and 
CTTGCTGTCTTCTTTCTCCTTGGC) . PCR with these primers on 
BAC DNA showed the presence of the genomic sequence 

35 encoding this variant in 3 out of the 4 BACs (not 
present in BAC 415J11) . 
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BACs containing the genomic sequence encoding the 
other 5' end variant of Hs-unc-53/3 as shown as the 
variant in figure le were identified by hybridizing 
the Research Genetics human DNA GAC filters with 
5 primer TGATCTTCTAGCGTGTGACTCACTG, radioactively 

labeled using gamma- P3 2 -ATP and polynucleotide kinase . 
Positive BACs were 404F14, 450K18 and 764L15. 

Sequencing directly on the respective BACs in the 3' 
10 direction from within the 2 alternative 5' exons and 
comparison of the genomic DNA sequence with the 
previously determined cDNA sequence identified the GT 
intron donor site. Joining of the genomic sequences 

-T -w~ t-s ^ 4- K Cf ^ <ir ^-r* ^3 4-V« TVmnT.TTVM J-T — ~ 

15 after removal of the predicted intronic sequence 

restored for both variants the sequence of the 5' RACE 
experiment without affecting the translation of the 
Open Reading Frame. 

20 Cloning of Hs-unc-53/3 constructs 

With the aim of cloning the full-length Open 
Reading Frame of Hs-unc-53/3, primer pairs were 
selected such that the ORF could be amplified in 6 

25 overlapping fragments ranging in size from 1 to 2 kbp. 
Overlaps between the fragments were chosen such that 
they contain an endonuclease restriction enzyme 
recognition site suitable for cloning the full-length 
gen. For the 5' fragment, the downstream oriented 

30 primer was chosen to contain the first putative start 
codon (ATG) in variant 1 (the one shown in full in 
figure le) . PCR conditions using the Expand High 
Fidelity PCR system (Roche Molecular Biochemicals) for 
all of the fragments were as follows. Initial 

35 denaturation for 5' at 95°C; 30 cycles of denaturation 
at 95°C for 45", primer annealing at 55°C for 45" and 
extention at 72°C for 1' (3' for primer combination 
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A+B) ; followed by an additional incubation for 7' at 
72°C and storage at 4°C. PCRs were run on PE 
Biosystems 9700 PCR machines. 



Primer pairs used for cloning Hs-unc-53/3 fragments 



# 




Size 


Primer 


Sequence 






(bp) 






A- 


B 


2229 


A 


TOAGjL* 1 OQj/\(jL*/\1 Jt\l { <J{~'{~> lUi -1^1 J- UVjuU -L -L VJV>- 








B 


GGGGTGGGTCGACTTGTCAAGTGG 


c- 


■D 


847 


C 


ATGGAAGGACCATACCCAACTTGAC 








D 


CTTGTTCCAGCTTTCTGCCTAGATG 


E- 


•F 


781 


E 


C AGGT T CCT G G AG AAG AGGC AT G T C 








F 


GGT GAGGC AAT AT CTGG AT ACT T GG 


G- 


-H 


1291 


G 


AGGCAGCCAGGATCCAAGTATCCAG 








H 


T GC GAAGAT CT T TT GGG AGG AT GGT C 


I- 


-J 


1022 


I 


AACCATTGAAATGCTGAAGGCTCAG 








J 


GGTTATGGGATCTAATTAAGTCTCC 


K- 


-L 


1255 


K 


CACTAGCCTTGGTCTGAGCTCTGAC 








L 


TCACCCTCTAGAGGGTAGATTCAAG 



Primer A contains restriction sites (Xhol and 
nhel) suitable for final subcloning in an eukaryotic 
expression vector (pEGFPc3) and in a yeast-two-hybrid 

25 vector (pAS2-l) , respectively. 

PCR products were analyzed by agarose gel 
electrophoresis and were visualized by ethidium 
bromide staining. Splice variants as mentioned in 
figure le were observed as multiple bands on agarose 

30 gels. Single band PCR products were purified with the 
Qiaquick PCR purification kit, whereas multiple band 
PCR products were cut out from gel as individual bands 
and purified using the Qiaquick gel extraction kit. 
PCR products were cloned in pCR2 . 1 according to the 

35 suppliers protocol (Invitrogen) . For each fragment, 
multiple clones were picked from selective LB agar 
plates and grown overnight under antibiotic selection 
pressure for DNA preparation either on the biorot 9600 
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10 



15 



20 



25 



30 



35 
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(Qiagen) , or manually on anion exchange columns 
(Qiagen tip 20 or tip 100) . Insert sequences were 
determined using the Bigdye terminator ready reaction 
cycle sequencing kit (PE Biosystems) . Individual 
sequencing reactions for each clone were assembled in 
single sequence contigs using the Sequencher software 
package (GeneCodes) . Sequences were compared to the 
previously determined consensus sequence using the 
SeqEd software package form PE Biosystems. For each 
fragment a clone was selected containing the correct 
sequence and the splice variant of interest. For the 
I-J fragment, a clone was selected that missed the 
hart specific 22 amino acid splice variant (figure 



was cloned in the BamHI site of the pCR2 . 1 multiple 
cloning site to facilitate subcloning of the full- 
length gene into the yeast-two-hybrid vector (pAS2-l) 
and the eukaryotic expression vector (pEGFPc3) , 
respectively. 

The overall cloning strategy of the full-length 
gene is visualized in figure 7a. 7al illustrates the 
overlapping PCR fragments and the nomenclature of 
fragments and primer pairs. 7a2 illustrates the 
assembly of the 3' half of the gene in pCR2 . 1 . 
Internal BamHI (I-J fragment) and Xhol (K-L fragment) 
sites as well as restriction sites from the multiple 
cloning site of pCR2 . 1 (as shown in the figure) were" 
removed by side-directed mutagenesis (SDM) using the 
Quickchange Site-Directed mutagenesis kit 
(stratagene) . The Notl-EcoRI G-H fragment and the 
EcoRI-Nhel I-Jd22 (622 indicating that the 22 amino 
acid splice variant is absent) were directionally 
cloned in the NotI and Nhel sites of the K-L fragment 
clone. Multiple clones were picked and verified by 
DNA sequencing. 7a3 illustrates the assembly of the 
5' half. Internal Xhol (C-D fragment) and Sfil and 
Xhol (E-F fragment) sites were removed by SDM. 
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Inserts were cut out from the vectors by restriction 
digestion with the appropriate restriction enzymes 
(Xhol+Sall; Sall+Narl and Narl+BamHI, respectively) 
and purified from gel after agarose gel 
5 electrophoresis. The 3 fragments were ligated 

together, re-cut with Xhol and BamHI and separated on 
gel. The band of the expected size was cut out of 
gel, purified and cloned in front of the 3' half, 
opened by digestion with Xhol and BamHI (figure 7a4) . 
10 Multiple clones were picked and verified by 
sequencing. 

Figure 7a illustrates the modular nature of the 
cloning project. For all the possible combinations of 
splice variation within the building block fragments, 
15 one representative clone is available. In view of 

functional analysis, building blocks can be exchanged 
easily by standard technology, either in the pCR2 . 1 
construct or in the final eukaryotic expression or 
yeast-two-hybrid construct. 



20 



Construct of Hs-unc-53/3 GFP chimeras 



The construction of the mammalian expression 
vectors pGI3303 and P GI3305 is explained in the 
25 legends of figure 7a, 7b and 7d. pG13303 can be used 
to over-express in mammalian cells or animals a fusion 
protein between eGFP and 1128 AA C-terminal fragment 
of Hs-unc-53/3 (Fig 7c) . pG3305 can be used to 
overexpress in mammalian cells or animals a fusion 

30 protein between eGFP and the 2363 AA full length Hu- 

unc-53/3 (fig 7d) . The Hs-unc-53/3 cDNA in pGI3303 as 
well as in pGI3305 contains silent mutations that 
introduce or remove specific restriction sites in 
order to be able to easily subclone different types of 

35 alternative splice variants in these vectors. 
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Genomic DNA sequencing (BAC 585E09) 

Using the primers AGGACCCTATGCGGAGGTCAAGCCGC and 
TGGGTTGGCATCATCGCTGTCGTAGC, a PCR specific for Hs-unc- 
53/1 was developed. PCR products were radiolabeled 
using the Random Prime DNA labeling kit (Roche 
Molecular Biochemicals) and hybridized on the human 
genomic DNA BAC filters (Research Genetics) . Positive 
signals were obtained for BAC clones 366H21, 483L14, 
471 J09 and 585E09. BAC DNA was isolated from E. coli 
genomic clone 585E09 according to a modified Qiagen 
plasmid DNA preparation procedure. A shotgun library 
of 1920 clones was constructed at GATC (Konstanz, 
Germany) . BAC DNA was prepared, nebulized and 
subcloned after end-repairing in the sequence vector 
pTZ19R. At JRF, DNA was prepared on the Biorobot 9600 
(Qiagen) from 144 0 clones. End sequencing reactions 
with Ml 3 forward ( TGTAAAACGACGGCCAGT ) and reverse 
(CAGGAAACAGCTATGACC) primer were done on 7 68 clones. 
672 additional clones were sequenced with M13 only. 5 
1*1 DNA was used in 15 /zl final reaction volume using 
the BigDye Terminator Ready Reaction sequencing kit. 
Sequencing reactions were run on MJ Research PTC200 
PCR machines. Reaction products were run and analysed 
on PE ABI 377 DNA sequencers. All sequencing results 
were imported in the Sequencher (GeneCodes) software 
package. Contaminating vector sequences and trailing 
sequences of low quality were trimmed. Individual 
sequences were assembled in contigs with standard 
software settings. A great number of contigs were 
constructed ranging from below 500 bp to over 10 kbp. 
Singletons are also still present. By looking for 
strings of known sequence, a contig was found 
containing the known and reliable 5' end of hUNC53hl 
and extending this sequence in 5' direction. This 
sequence and its relevant features are described in 
figure lg and its legend. 
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Northern blotting 

A Human multiple tissue Norther- (MTN-1, 
Clontech) containing in each lane 2 mg of poly A + RNA 
5 from eight different human tissues (heart, brain, 

placenta, lung, liver, skeletal muscle, kidney, and 
pancreas) and a MTN-II human multiple tissue Northern, 
containing in each lane 2 mg of poly A + RNA from 
spleen, thymus, prostate, testis, ovary, small 

10 intestine, colon and peripheral leukocyte, were 
hybridized according to the manufacturer's 
instructions and washed out in 0 . lxSSC: 0 . 2% SDS at 
55°C, Also from Clontech, a poly A + RNA blot from 
human cancer cell lines (melanoma G361, lung carcinoma 

15 A549, colorectal adenocarcinoma SW480, Burkitt's 

lymphoma Raji Leukemia Molt 4, lymphoblastic leukemia 
K562, HeLa S3 and promyelocytic leukemia HL60) was 
tested. 

20 Cancer cell lines RNA blots probed with Hs-unc- 

53/3 

A set of cancer cell line Northern blots were 
probed with a 665 bp fragment of Hs-unc-53/3 amplified 

25 by using the primers 5' AGGAATTAAAATTAACGGATATTCGG and 
5' AAAACTGTCCAAACTATTTTCTTCTACC. HU-unc-53/3 is 
expressed in Melanoma G3 61 and lung carcinoma A54 9, 
transcripts sizes were detected of >0.5 kb. No 
expression was detected in promyelocytic leukemia HL- 

30 60 HeLa cell S3, chronic myelogenous leukemia K-562, 
leukemia MOLT-4, Burkitt's lymphoma Raij and 
colorectal adenocarcinoma SW480. 

Normal human tissue RNA blots probed with Hs-unc- 
35 53/3 

A set of normal human tissue Northern blots were 
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probed with a 665 bp fragment of Hs-unc-53/3 amplified 
by using the primers 5' AG GAAT T AAAAT T AAC GGAT AT T C G G and 
5' AAAACTGTCCAAACTATTTTCTTCTACC . High expression 
levels were detected in brain, spleen, ovary and 
spinal cord, lower levels in heart, placenta, testis, 
stomach, and adrenal gland. Transcripts sizes were >= 
9.5 kb. 



10 



FISH 

Hs-UNC-53/3 is localised to chromosome 12q21.1 
Slides preparation : 

15 Lymphocytes isolated from human blood were 

cultured in a-minimal essential medium (MEM) 
supplemented with 10% foetal calf serum and 
phytohaemagglutinin (PHA) at 37°C for 68-72 hr. The 
lymphocyte cultures were treated with BrdU (0.18mg/ml 

20 Sigma) to synchronise the cell population. The 

synchronised cells were washed three times with serum- 
free medium to release the block and recultured at 
37°C for 6 hr in a a-MEM with thymidine (2.5//g/ml: 
Sigma) . Cells were harvested and slides were made by 

25 using standard procedures including hypotonic 
treatment fix and air-dry . 

In situ hybridisation and FISH detection: 

30 A cDNA probe was biotinylated with dATP using the 

BRL BioNick labelling kit (15°C, 1 hr) Heng et al, 
1992) . The procedure for FISH detection was performed 
according to Heng et al * , 1992 & Heng and Tsui, 1993, 
Heng et al..: Proc Natl Acad Sci USA 89: 9509-9513 

35 (1992). Heng et al . Chromosoma 102: 325-332 (1993). 

Briefly, slides were baked at 55°C for 1 hour. After 
RNase treatment, the slides were denatured in 70% 
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formamide in 2xSSC for 2 min. at 70°C followed by 
dehydrated with ethanol . Probes were denatured at 
75°C for 5 min. in a hybridisation mix consisting of 
50% formamide and 10% dextran sulphate. Probes were 
5 loaded on the denatured chromosomal slides. After 
over night hybridisation, slides were washed and 
detected as well as amplified. FISH signals and the 
DAPI banding pattern were recorded separately by 
taking photographs, and the assignment of the FISH 
10 mapping data with chromosomal bands was achieved by 
superimposing FISH signals with DAPI banded 
chromosomes (Heng et al, 1993) . 

Results 

15 

Under the condition used the hybridisation 
efficiency was approximately 67% for this probe (among 
100 checked mitotic figures, 67 of them showed signals 
on one pair of the chromosomes) . Since the DAPI 
20 banding was used to identify the specific chromosome, 
the assignment between signal from probe and the long 
arm of chromosome 12 was obtained. The detailed 
position was further determined in the diagram based 
on the summary from 10 photos. 

25 

Radiation Hybrid Mapping 

Radiation hybrid analysis is a PCR technique and 
the panels of radiation hybrid DNA are provided at a 

30 concentration of 25 in TE buffer suitable for 

these reactions. Typically, 25 ng of DNA is used in a 
10 fil PCR reaction. 

Some of the radiation hybrid panels are supported 
by an e-mail server which can assist you in the 

35 chromosome localization of markers. A server for the 
chromosome localization of markers using the Stanford 
G3 and Stanford TNG panels is available at http://www- 
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shgc.stanford.edu. At the time of catalog 
publication, the Stanford TNG server was capable of 
chromosome localization only on chromosomes 2, 4, 7 
and 21. Chromosome localization of markers from the 
GeneBridge4 panel may be performed by accessing the 
server at http://www-genome.wi.mit.edu. RH mapping 
involves the statistical analysis of several to many 
markers to determine the relative order of the markers 
with respect to one another. RH mapping can be 
achieved using statistical programs that will provide 
the best map along with a measure of the relative 
likelihood of one order versus another. 

This type of analysis has been shown to 
successfully generate the order of markers on the RH 
map that is significantly more likely than any 
alternative order. Two statistical programs for RH 
mapping can be downloaded from the World Wide Web free 
of charge. SAMapper was produced at the Stanford 
Human Genome Center and be downloaded at http://www- 
shgc . Stanford . edu/Mapping/SAMapper/ index . html RHMAP 
was written by Michael Boehnke at the University of 
Michigan and can be downloaded at 

http : //www . sph . umich . edu/ group/ statgen/ software . A 
comprehensive web page regarding radiation hybrid 
mapping, with links to web sites with analysis 
software and other information, can be found at 
http: //linkage. rockefeller .edu/ tara/rhmap/ 

Transfection protocol for cells 

N$ neuroblastoma lines were seeded in Lab Tek 
chambered coverglass (Nalgene Nunc International) and 
transfected with pEGFP (control), pGI3303 and pGI3305 
using lipof ectamine (Life Technologies BRL) . After 
24-48 hours, the chambered coverglasses were placed on 
an inverted fluorescence microscope where GFP 
fluorescence could be visualized in living cells. The 
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details of this method have been described in 
PCT/EP96/02311. 

Microscopy and fluorescence staining using 
phalloidin 

have been described earlier (EP97/06956) . 
SEQUENCE LISTING 

Seq ID No 1 is a nucleic acid sequence of Hs unc-53/1 
and lacking the nucleotides from position 2873 to 3043 
shown in Fig. la. 

15 Seq ID No. 2 is a nucleic acid sequence of Hs unc-53/1 
and lacking the nucleotides from position 3098 to 3121 
shown in Figure la. 

Seq ID no. 3 is a nucleic acid sequence of Hs-unc-53/1 
20 and lacking the nucleotides from position 3518 to 3526 
of the sequence identified in Fig. la. 

Seq ID No. 4 is an amino acid sequence of Hs-unc-53/1 
protein and lacking the amino acids from position 958 
25 to 1014 of the sequence identified in Fig. lb 

Seq ID No. 5 is a amino acid sequence of Hs-unc-53/1 
protein and lacking the amino acids from position 1033 
to 1040 of the sequence identified in Fig. lb. 

30 

Seq ID No. 6 is a amino acid sequence of Hs-unc-53/1 
protein and lacking the amino acids from position 1173 
to 1175 of the sequence identified in Fig. lb. 

35 Seq ID No. 7 is a nucleotide sequence encoding Hs- 
unc-53/2 and lacking the nucleotides from position 
5425 to 5433 of the sequence illustrated in Fig. 1c. 
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Seq ID No. 8 is a nucleotide sequence encoding Hs- 
unc-53/2 and lacking the nucleotides from position 
5924 to 6024 of the sequence illustrated in Fig. lc, 

5 Seq ID No. 9 is a nucleotide sequence encoding Hs- 
unc-53/2 and having the sequence of variant 1 
illustrated in Fig. lc. 

Seq ID No. 10 is a nucleotide sequence encoding Hs- 
10 unc-53/2 and having the sequence of variant 2 
illustrated in Fig. lc. 

Seq ID No. 11 is a nucleotide 'sequence encoding "Hs- 
unc-53/2 and having the sequence of variant 3 
15 illustrated in Fig. lc. 



Seq ID No. 12 is a nucleotide sequence encoding Hs- 
unc-53/2 and having the sequence of variant 1 
illustrated in Fig. lc. and lacking the nucleotides 
20 from position 5425 to 5433 of the sequence illustrated 
in Fig. lc. 

Seq ID No. 13 is a nucleotide sequence encoding Hs- 
unc-53/2 and having the sequence of variant 1 
25 illustrated in Fig. lc. and lacking the nucleotides 
from position 5924 to S024 of the sequence illustrated 
in Fig. lc. 

Seq ID No. 14 is a nucleotide sequence encoding Hs- 
30 unc-53/2 and having the sequence of variant 2 

illustrated in Fig. lc. and lacking the nucleotides 
from position 5425 to 5433 of the sequence illustrated 
in Fig. lc. 

35 Seq ID No. 15 is a nucleotide sequence encoding Hs- 
unc-53/2 and having the sequence of variant 2 
illustrated in Fig. lc. and lacking the" nucleotides 
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from position 5924 to 6024 of the sequence illustrated 
in Fig. lc. 

Seq ID No. 16 is a nucleotide sequence encoding Hs- 
unc-53/2 and having the sequence of variant 3 
illustrated in Fig. lc. and lacking the nucleotides 
from position 5425 to 5433 of the sequence illustrated 
in Fig- lc. 

Seq ID No. 17 is a nucleotide sequence encoding Hs- 
unc-53/2 and having the sequence of variant 3 
illustrated in Fig. ...lc.^and. lacking the s nucleotides ^ 
from position 5324 "tc 6024 of the sequence illustrated 
in Fig. lc. 

Seq ID No. 18 is an amino acid sequence of Hs-unc- 
53/2 protein and lacking the amino acids from position 
1776 to 1778 of the sequence identified in Fig. Id 

Seq Id No. 19 is an amino acid sequence of variant 1 
of Hs-unc-53/2 sequence illustrated in Fig. Id. 

Seq Id No. 20 is an amino acid sequence of variant 2 
of Hs-unc-53/2 sequence illustrated in Fig. Id. 

Seq Id No. 21 is an amino acid sequence of variant 3 
of Hs-unc-53/2 sequence illustrated in Fig. Id. 

Seq Id No. 22 is an amino acid sequence of variant 1 
of Hs-unc-53/2 sequence illustrated in Fig. Id and 
lacking the amino acids from position 1776 to 1778 of 
the sequence identified in Fig. Id. 

Seq Id No. 23 is an amino acid sequence of variant 2 
of Hs-unc-53/2 sequence illustrated in Fig. Id and 
lacking the amino acids from position 1776 to 1778 of 
the sequence identified in Fig. Id. 
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Seq Id No. 24 is an amino acid sequence of variant 3 
of Hs-unc-53/2 sequence illustrated in Fig. Id and 
lacking the amino acids from position 1776 to 1778 of 
the sequence identified in Fig. Id. 

5 

Seq ID No. 25 is a nucleotide sequence encoding Hs- 
unc-53/3 as illustrated in Figure le. 

Seq ID No. 26 is a nucleotide sequence encoding Hs- 
10 unc-53/3 as illustrated in Figure le and lacking the 

nucleotides from position 3795 to 4283 of the sequence 
identified therein. 

Seq ID No. 27 is a nucleotide sequence encoding Hs- 
15 unc-53/3 as illustrated in Figure le and lacking the 

nucleotides from position 4284 to 4325 of the sequence 
identified therein. 

Seq ID No. 28 is a nucleotide sequence encoding Hs- 
20 unc-53/3 as illustrated in Figure le and lacking the 

nucleotides from position 3795 to 4325 of the sequence 
identified therein. 

Seq ID No. 29 is a nucleotide sequence encoding Hs- 
25 unc-53/3 as illustrated in Figure le and lacking the 

nucleotides from position 5153 to 5173 of the sequence 
identified. 

Seq ID No. 30 is a nucleotide sequence encoding Hs- 
30 unc-53/3 as illustrated in Figure le and lacking the 

nucleotides from position 5343 to 5408 of the sequence 
identified. 

Seq ID No. 31 is a nucleotide sequence encoding Hs- 
35 unc-53/3 having the sequence of variant 1 illustrated 
in Fig. le. 
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Seq ID No. 32 is a nucleotide sequence encoding Hs- 
unc-53/3 having the sequence of variant 1 illustrated 
in Fig. le and lacking the nucleotides from position 
3795 to 4283 of the sequence identified therein. 

Seq ID No. 33 is a nucleotide sequence encoding Hs- 
unc-53/3 having the sequence of variant 1 illustrated 
in Fig. le and lacking the nucleotides from position 
4284 to 4325 of the sequence identified therein. 

Seq ID No. 34 is a nucleotide sequence encoding Hs- 
unc-53/3 having the sequence of variant 1 illustrated 
in Fig. le and lacking the nucleotides from position 
3795 to 4325 of the sequence identified therein. 

Seq ID No. 35 is a nucleotide sequence encoding Hs- 
unc-53/3 having the sequence of variant 1 illustrated 
in Fig. le and lacking the nucleotides from position 
5153 to 5173 of the sequence identified therein. 

Seq ID No. 36 is a nucleotide sequence encoding Hs- 
unc-53/3 having the sequence of variant 1 illustrated 
in Fig. le and lacking the nucleotides from position 
5343 to 5408 of the sequence identified therein. 

Seq ID No. 37 is an amino acid sequence of Hs-unc- 
53/3 protein as identified in the sequence of Fig. 
If. 

Seq ID No. 38 is an amino acid sequence of Hs-unc- 
53/3 protein as identified in the sequence of Fig. If 
and lacking the amino acid residues from position 1326 
to 1413 of the sequence identified therein. 

35 Seq ID No. 39 is an amino acid sequence of Hs-unc- 

53/3 protein as identified in the sequence of Fig. If 
and lacking the amino acid residues from position 1414 
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to 1427 of the sequence identified therein. 

Seq ID No. 4 0 is an amino acid sequence of Hs-unc- 
53/3 protein as identified in the sequence of Fig. If 
and lacking the amino acid residues from position 1703 
to 1709 of the sequence identified therein. 

Seq ID No. 41 is an amino acid sequence of Hs-unc- 
53/3 protein as identified in the sequence of Fig. if 
and lacking the amino acid residues from position 1768 
to 1788 of the sequence identified therein. 

Seq ID No. 42 is an amino acid sequence of Hs-unc-53 
of variant 1 identified in Figure If. 



Seq ID No. 43 is an amino acid sequence of Hs-unc-53 
of variant 1 identified in Figure If and lacking the 
amino acid residues from position 1326 to 1413 of the 
sequence identified therein. 

Seq ID No. 44 is an amino acid sequence of Hs-unc-53 
of variant 1 identified in Figure If and lacking the 
amino acid residues from position 1414 to 1427 of the 
sequence identified therein. 

Seq ID No. 45 is an amino acid sequence of Hs-unc-53 
of variant 1 identified in Figure If and lacking the 
amino acid residues from position 1703 to 1709 of the 
sequence identified therein. 

Seq ID No. 4 6 is an amino acid sequence of Hs-unc-53 
of variant 1 identified in Figure If and lacking the 
amino acid residues from position 1768 to 1788 of the 
sequence identified therein. 
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1. A vertebrate protein homologue of a UNC-53 
protein of r.. sleaans , which protein comprises an 
amino acid sequence having one or more of sequence 
blocks A, B, C, D, E, F, G, or H as illustrated in 
figure 4 or which differs from said blocks in 
conservative amino acid changes. 



2. A vertebrate protein homologue of UNC-53 
protein of r.. eleaans or a functional equivalent, 
derivative or bioprecursor therefor having an amino 
acid sequence encoded by the nucleotide sequence 
illustrated in figure 1(e) or the sequence of Figure 1 
15 e having nucleotide region from position 1 to 288 

replaced with the sequence of variant 1 illustrated in 
Figure le and or which sequences further lack any of 
the sequences form 3795 to 4283, 4284 to 4325, 5153 to 
5173 or 5343 to 5408. 

3. A vertebrate protein homologue of UNC-53 
protein of r.- Regans having an amino acid sequence as 
illustrated in figure 1(f) or an amino acid sequence 
which differs from said amino acid sequence 
illustrated in figure 1(f) by the replacement of amino 
acids 1 to 81 with the sequence of variant 1 in figure 
If and /or including deletions from position 1326 to 
1413, 1414 to 1427, 1703 to 1709 or 1768 to 1788, or 
which differs from said sequences in one or more 
conservative amino acid changes. 



4 . A cDNA molecule encoding a vertebrate 
homologue of UNC-53 protein of C. eleqans according to 
any of claims 1 to 3. 

5. A cDNA molecule according to claim 4 which 
cDNA comprises the sequence of nucleotides illustrated 
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in figure 1 (e) . 

6. A nucleic acid molecule capable of 
hybridising to the cDNA sequences according to claims 
4 or 5 under high stringency conditions. 

7. A DNA expression vector which comprises a 
cDNA molecule as claimed in claim 4 or 5. 



10 



8. A vector according to claim 7 which 
comprises a promoter of C. eleoans UNC-53 protein or a 
vertebrate homologue thereof according to any of 
claims 1 to 7 . 



15 9. A vector according to claim 8 wherein said 

promoter sequence is derived from a gene encoding a 
mouse or human homologue of a UNC-53 protein of C. 
eleoans . 



20 



10. A vector according to any of claims 7 to 9 
which further comprises a sequence encoding a reporter 
molecule . 



11. A vector according to claim 10 wherein said 
25 reporter molecule is a fluorophore. 

12. A host cell transformed or transfected with 
the vector of any of claims 7 to 11. 



30 



13. A host cell transformed or transfected with 
the vector of claims 10 or 11. 



35 



14. A host cell according to claim 12 or 13 
which cell comprises a prokaryotic cell, such as a 
bacterial cell or a eukaryotic cell such as a fungal, 
and animal, a plant or an insect cell. 
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15. A transgenic cell/ tissue or organism 
comprising a transgene capable of expressing a protein 
according to any of claims 1 to 3. 

16. A transgenic cell, tissue or organism 
according to claim 15 which comprises any of a COS 
cell, Hep G2, MCF-7 cell, N4 mouse neuroblastoma cell, 
a NIH3T£ cell, or colorectal carcinoma or human 
derived cells. 

17. A transgenic cell, tissue or organism 
according to claim 15 or 16 wherein said transgene 
comprises a vector according to any of claims 7 to 11. 

18. A transgenic cell, tissue or organism 
according to claim 15 or 17 wherein said transgene 
comprises a vector according to claim 10 or 11. 

19. A transgenic cell, tissue or organism 
according to any of claims 15 to 17 wherein said 
organism comprises any of an insect, a fungus, a non- 
human mammal, a plant or a nematode worm. 

20. A method of producing a mutant vertebrate 
non-human organism which mutation affects cell 
behaviour or the regulation of cell motility or the 
shape or the direction of cell migration, which method 
comprises inducing a mutation in the wild type gene 
encoding the vertebrate homologue of an UNC-53 

C. eleaans protein. 

21. A vertebrate protein homologue of an UNC-53 
protein of C. eleaans , according to any of claims 1 to 
3 for use as a medicament. 

22. Use of a vertebrate protein homologue of an 
UNC-53 protein of C. eleaans , according to any of 
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claims 1 to 3 in the manufacture of a medicament for 
promoting neuronal regeneration, revascularisation, 
wound healing or for treatment of chronic 
neurodegenerative diseases or acute traumatic injuries 
5 or fibrotic disease or autoimmune diseases such as 
rheumatoid arthritis and sclerosis. 

23. A pharmaceutical composition comprising a 
vertebrate homologue of an UNC-53 protein of C_s_ 
10 eleaans / according to any of claims 1 to 3 together 

with a pharmaceutically acceptable carrier, diluent or 
excipient therefor . 

o a n 1 1 1 ^ i ^ acid or cDNA molecule according to 

15 any of claims 4 to 6 or a functional fragment thereof 
for use as a medicament. 

25. Use of nucleic acid or cDNA molecule 
according to any of claims 4 to 6 in the manufacture 

20 of a medicament to promote neuronal regeneration, 

revascularisation or wound healing, or for treatment 
of chronic neurodegenerative diseases or acute 
traumatic injuries or fibrotic disease or autoimmune 
diseases such as rheumatoid arthritis and sclerosis. 

25 

26. A pharmaceutical composition comprising a 
nucleic acid or cDNA molecule according to any of 
claims 4 to 6 and a pharmaceutically acceptable 
carrier, diluent or excipient therefor. 

30 

27. A method of determining whether a compound 
is an inhibitor or enhancer of the regulation of cell 
behaviour, growth, cell shape or motility or the 
direction of cell migration, which method comprises 

35 contacting said compound with a host cell according to 
claim 12 or 14 or a transgenic cell as claimed in any 
of claims 15 to 18 and screening for a phenotypic 
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■change in said cell. 

28. A method according to claim 27 wherein said 
phenotypic change to be screened is a change in cell 
5 growth, or shape or a change in cell motility or 
filopodia outgrowth, ruffling behaviour, cell 
adhesion, contact inhibition or the length of neurite 
growth. 

10 29. A method as claimed in claim 27 wherein said 

transgenic cell is an N4 neuroblastoma cell and the 
phenotypic change to the screened is the length of 
neurite growth. 

15 30. A method as claimed in claim 27 wherein said 

transgenic cell is an MCF-7 breast carcinoma cell or 
an NIH3T3 cell and the phenotypic change to be 
screened is the extent of phagokinesis or contact 
inhibition. 

20 

31. A method of determining whether a compound 
is an inhibitor or an enhancer of the regulation of 
cell shape, cell growth or motility or of the 
direction of cell migration, which method comprises 

25 administering said compound to a transgenic organism 
according to any of claims 15 to 19 or a mutant 
organism produced according to the method of claim 20 
and screening for a phenotypic change in said 
organism. 

30 

32. A compound which is identifiable by the 
method according to claim 27 as an enhancer of the 
regulation of cell shape, or growth or motility or the 
direction of cell migration for use as a medicament. 

35 

33. Use of a compound which is identifiable by 
the method according to claim 27 as an enhancer of the 
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regulation of cell shape, or growth or motility or the 
direction of cell migration in the preparation of 
medicament for promoting neuronal regeneration, 
revascularisation or wound healing or for treatment of 
5 chronic neurodegenerative diseases or acute traumatic 
injuries or fibrotic disease autoimmune diseases such 
as rheumatoid arthritis or sclerosis. 

34. A pharmaceutical composition comprising a 
10 compound identified according to the method of any of 

claims 27 to 31 and a pharmaceutical^ acceptable 
carrier, diluent or excipient therefor. 

35. A compound which is identifiable by the 

15 method according to any one of claims 17 to 31 as an 
inhibitor of the regulation of cell motility, growth, 
or shape, or the direction of cell migration, for use 
as a medicament. 

20 36. Use of a compound according to claim 35 in 

the manufacture of a medicament for alleviating the 
spread of disease inducing cells or metastasis or loss 
of contact inhibition. 

25 37 . A pharmaceutical composition comprising the 

compound as claimed in claim 35, and a 
pharmaceutically acceptable carrier diluent or 
excipient therefor . 

30 38. A method of determining whether a compound 

is an inhibitor or an enhancer of transcription of a 
gene encoding a vertebrate homologue of UNC-53 protein 
of C . elegans , according to any of claims 1 ro 3 which 
method comprises the steps of (a) contacting said 

35 compound with a cell according to claim 13 or 18 and 

(b) monitoring the level of said reporter molecule and 
comparing the results obtained from said monitoring 
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step with a control comprising a cell according to 
claims 13 or 18, which cell has not been contacted- 
with said compound. 

39. A method as claimed in claim 38 wherein said 
reporter molecule detected is mRNA or green 
fluorescent protein. 

40. A compound which is identifiable by the 
method according to claims 38 or 39, as an enhancer of 
transcription of a gene coding for a vertebrate 
homologue of an UNC-53 protein of C. eleqans according 
to any of claims 1 to 3 or a functional fragment of 
said gene, for use as a medicament. 



41. Use of a compound which is identifiable by 
the method of claims 38 or 39, as an enhancer of 
transcription of a gene coding for a vertebrate 
homologue of an UNC-53 protein of c. eleqans according 

20 to any of claims 1 to 3 or a functional fragment of 
said gene, in the manufacture of a medicament for 
promoting neuronal regeneration, revascularisation or 
wound healing, or for treatment of chronic' neuro- 
degenerative diseases or acute traumatic injuries or 

25 fibriotic disease or autoimmune diseases such as 
rheumatoid arthritis or sclerosis. 

42. A pharmaceutical composition which comprises 
the compound of claim 4 0 and a pharmaceutical^ 

30 acceptable carrier, diluent or excipient therefor. 

43. A compound which is identifiable by the 
method of claims 38 or 29 as an inhibitor of 
transcription of a gene coding for vertebrate 

35 homologue of a UNC-53 protein of C. eleqans according 
to any of claims 1 to 3 or a functional fragment of 
said gene for use as a medicament. 
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44. Use of a compound which is identifiable by 
the method of claims 38 or 39 as an inhibitor of 
transcription of a gene coding for a vertebrate 
homologue of an UNC-53 protein of C. eleaans or a 
functional fragment of said gene, in the manufacture 
of a medicament for alleviating spread of disease 
inducing cells or metastasis or loss of contact 
inhibition. 



45. A pharmaceutical composition which comprises 
the compound of claim 43 and a pharmaceutically 
acceptable carrier, diluent or excipient therefor, 

46. A kit for determining whether a compound is 
an enhancer or an inhibitor of the regulation of cell 
motility, growth or shape or the direction of cell 
migration which kit comprises at least one transgenic 
cell as claimed in any one of claims 13 to 17 to be 
contacted with said compound and at least one cell 
according to claims 1 2 to 19 to be used as a control 
and means for contacting said compound with one of 
said at lest one transgenic cells. 

47. A kit for determining whether a compound is 
an inhibitor or an enhancer of transcription of a 
gene coding for a vertebrate homologue of an UNC-53 
protein of C. eleqans or a functional fragment of said 
gene which kit comprises at least one cell as claimed 
in any one of claims 12 to 19 and means for contacting 
said compound with said cells. 



48. A kit for determining whether a compound is 
an enhancer or an inhibitor of the activity of a 
vertebrate homologue of an UNC-53 protein of 
C. eleqans or a functional equivalent, derivative, 
fragment or bioprecursor of said vertebrate homologue 
protein, which kit comprises at least, one vertebrate 



BNSDOCID: <WO 9963080A1J_> 



WO 99/63080 




PCT7EP99/03848 



mutant non-human organism produced according to the 
method as claimed in claim 20 or a transgenic organism 
as claimed in claims 15 to 19 and a wild type of said 
vertebrate mutant organism. 

5 

49. A method identifying vertebrate 
homologues of an unc-53 gene of C. eleqans or a 
functional fragment thereof, which method comprises 
hybridizing to a DNA library a suitable 

10 oligonucleotide sequence of between 15 to 50 

nucleotides of the nucleic acid sequence encoding UNC- 
53 or a functional equivalent, derivative or 
bioprecursor thereof, under appropriate conditions of 
stringency to identify genes having statistically 

15 significant homology with the cDNA according to any of 
claims 4 or 5. 

50. A method of identifying a protein which is 
active in the signal transduction pathway of a cell of 

20 which a vertebrate homologue of an UNC-53 protein of 
c. elecrans according to any of claims 1 to 3 is a 
component, which method comprises: 

(a) contacting an extract of said cell with an 
antibody to the vertebrate homologue of the 

25 UNC-53 protein of C. eleqans, 

(b) identifying the antibody/vertebrate 
homologue complex, and 

(c) analysing the complex to identify any 
protein bound to the vertebrate homologue of 

30 UNC-53 protein of r. . elecrans other than the 

antibody. 

51. A method of identifying a further protein 
which is active in the signal transduction pathway of 
35 a cell of which a vertebrate homologue of an UNC-53 
protein according to any of claims 1 to 3 is a 
component, which method comprises: 
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(a) forming an antibody to the first 
identified protein bound to the vertebrate 
homologue of UNC-53 protein of c. eleaans in 
claim 50, 

(b) contacting a cell extract with said 
antibody and identifying the 
antibody/protein complex, 

(c) analysing the complex to identify any 
further protein bound to the first protein 
other than the antibody, and 

(d) optionally repeating steps (a) to (c) 
to identify further proteins in said 
pathway, 

15 52. A method of identifying a protein which is 

active in the signal transduction pathway of a cell of 
which a vertebrate homologue of an UNC-53 protein of 
C. eleqans according to any of claims 1 to 3 is a 
component, which method comprises: 
20 < a ) contacting an extract of said cell with 

said vertebrate homologue of an UNC-53 
protein of C, eleaans , 

(b) identifying any vertebrate homologue of 
UNC-53 protein/protein complex formed and 
25 (c) analysing the complex to identify any 

protein bound to the vertebrate homologue of 
UNC-53 protein other than the same 
vertebrate homologue of UNC-53 protein, 

30 53. A method according to claim 52 which further 

comprises contacting a cell extract with any protein 
identified from step (c) not being the same as the 
vertebrate homologue of UNC-53 protein used and 
repeating steps (b) and (c) so as to identify any 

35 further protein involved in the signal transduction 
pathway of said cell. 
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54 . A method of identifying a protein involved 
in the signal transduction pathway of a cell of which 
a vertebrate homologue of an UNC-53 protein of 
Pleaans is a component which method comprises: 
5 (a) providing an appropriate host cell 

having a DNA construct comprising a reporter 
gene under the control of a promoter 
regulated by a transcription factor having a 
DNA binding domain and an activating domain, 
10 (b) expressing in said host cell a first 

hybrid DNA sequence encoding a first fusion 
of a fragment or all of a DNA sequence 
according to claims 4 or 5 and either said 
DNA binding domain or the activating domain 
15 of the transcription factor, 

(c) expressing in the host cell at least 
one second hybrid DNA sequence encoding a 
putative binding protein to be investigated 
together with the DNA binding or activating 

2 0 domain of the transcription factor which is 

not incorporated in the first fusion, 

(d) detecting any binding of the protein 
being investigated with a protein according 
to any of claims 1 to 3 by detecting for the 

25 production of any reporter gene product in 

said host. 

55. A protein identified by the method of any 
one of claims 50 to 54 for use as a medicament. 



30 



56. Use of a protein identified by the methods 
of any one of claims 50 to 54 in the manufacture of a 
medicament for promoting neuronal regeneration, 
revascularisation or wound healing, or for treatment 
35 of chronic neurodegenerative diseases or acute 

traumatic injuries or fibrotic disease or autoimmune 
diseases such as rheumatoid arthritis and sclerosis. 
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57. A pharmaceutical composition comprising a 
protein identified by the methods of any one of clai 
50 to 54 and a pharmaceutically acceptable carrier, 
diluent, or excipient therefor. 



58 . A process for producing a vertebrate 
homologue of an UNC-53 protein of C. eleaans according 
to any of claims 1 to 3 which process comprises 
culturing the cells of any of claims 12 to 14 and 
recovering said vertebrate homologue of UNC-53 protein 
expressed. 

59. A process for producing a vertebrate 

Vt /-\m /~\ 1 /-> /->r-i t ^ ^ -P — . « T TXT/"* C *\ ._ . i _ 

-w^.-wy^c an uiv^-jj protein or c. eleaans according 
to any of claims 1 to 3 which process comprises 
culturing an insect cell transfected with a 
recombinant Baculovirus vector, said vector comprising 
a DNA insert encoding said vertebrate homologue of 
UNC-53 protein downstream of the Baculovirus 
polyhedrin promoter, and recovering the expressed 
vertebrate homologue of UNC-53 protein. 

60. A method of detecting whether a compound is 
an inhibitor or an enhancer of expression of a 
vertebrate homologue of an UNC-53 of C. eleaans 
according to any of claims 1 to 3 which method 
comprises contacting a cell expressing said homologue 
with said compound and monitoring for a phenotypic 
change compared to a control cell which has not been 
contacted with said compound. 

61. A method according to claim 60 wherein said 
cell comprises a cell according to any of claims 12 to 
19. 



5 



62. A method according to claim 60 wherein said 
cell has undergone loss of contact inhibition. 
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63. A method according to any of claims 60 to 62 
in which the compound to be tested comprises a nucleic 
acid. 

5 64. A method according to claim 63 wherein said 

nucleic acid sequence comprises an antisense DNA or 
RNA sequence . 

65. A method according to claim 64 wherein said 
10 mRNA sequence comprises 3' untranslated regions of 

mRNA encoding for said vertebrate homologue. 

66. A method according to any of claims 60 to 62 
wherein said compound to be tested comprises a protein 

15 having an amino acid sequence potentially suitable for 
inhibiting function of said vertebrate homologue. 

67. A method according to claim 66 wherein said 
protein comprises a protein identified according to 

20 any of the methods of claims 50 to 54. 

68. A pharmaceutical composition comprising a 
compound identified according to any of claims 60 to 
67 together with a pharmaceutical^ acceptable 

25 carrier, diluent or excipient therefor. 

69. A nucleic acid sequence identified according 
to the method of any of claims 63 to 65 for use as a 
medicament . 

30 

70. Use of a nucleotide sequence identified 
according to the method of any one of claims 63 to 65 
in the preparation of a medicament for the treatment 
of loss of contact inhibition or cancer which is 

35 mediated by a vertebrate homologue of an UNC-53 
protein of r. . eleaans. 
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71. Use of a nucleic acid according to claim 69 
in the preparation of a medicament for inhibiting 
expression of a gene coding for a vertebrate homologue 
of an UNC-53 protein of C. eleaans . 

72. An assay for detecting expression of a 
vertebrate homologue of UNC-53 protein of C. eleaans 
according to any of claims 1 to 3 in a vertebrate cell 
which assay comprises contacting a cell or an extract 
thereof with an antibody to said vertebrate homologue, 
which antibody is linked to a reporter molecule, 
removing any unbound antibody and monitoring for the 
presence of said reporter molecule. 

15 73. An assay according to claim 72 wherein said 

reporter molecule is an antibody conjugated with a 
suitable fluorophore or detectable enzyme. 



10 



20 



74. A method for detecting for expression of a 
gene coding for a vertebrate homologue of an UNC-53 
protein of C. eleaans according to any of claims 1 to 
3 which method comprises contacting a probe specific 
for a nucleic acid or protein sequence coding for or 
corresponding to said vertebrate homologue according 
25 to any of claims 1 to 3 with a cell extract which 

probe is linked to a reporter and analysing for the 
presence of said reporter. 
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75. A method according to claim 74 wherein said 
probe comprises a complementary sequence to a region 
of mRNA transcribed from said gene encoding said 
vertebrate homologue of UNC-53 protein. 



35 



76. A method according to claim 75 wherein said 
complimentary sequence is a 3' or 5' untranslated 
region of said mRNA. 
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77. A method according to claims 74 or 76 
wherein said reporter comprises a radiolabel. 

78. A method according to claim 74 wherein said 
5 probe comprises an antibody specific for said 

vertebrate homologue of said UNC-53 protein according 
to any of claims 1 to 3. 

79. A method according to claim 78 wherein said 
10 reporter comprises an antibody conjugated with a 

detectable fluorophore or enzyme. 

80. A method of determining whether a compound 
is an inhibitor or an enhancer of association of a 

15 vertebrate homologue according to any of claims 1 to 3 
to microtubules or plus end regions thereof, which 
method comprises :- 

(a) contacting said compound with a 
transgenic cell, tissue or organism 

20 expressing UNC-53 protein or said vertebrate 

homologue and which protein is operably 
linked to a reporter molecule, 

(b) screening for the localisation of said 
reporter molecule as compared to a cell 

25 according to step (a) which has not been 

contacted with said compound. 

81. A compound identifiable by the method 
according to claim 80. 

82. A compound according to claim 81 for use as 
a medicament . 

83. Use of a compound according to claim 81 as 
35 an enhancer of association of said vertebrate 

homologue with microtubules or the plus end region 
thereof, for use in promoting neuronal regeneration, 
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revascularisation or wound healing, or for treating 
chronic neurodegenerative diseases or acute trauma-tic 
injuries or fibrotic disease or autoimmune diseases 
such as rheumatoid arthritis or sclerosis. 

84. A pharmaceutical composition comprising the 
compound according to claims 81 or 82 and a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

85. A kit for determining whether a compound is 
an inhibitor or an enhancer of association of a 
vertebrate homologue according to any of claims 1 to 3 
with microtubules or the plus end regions thereof, 
which kit comprises at least one transgenic cell 
expressing said homologue and a reporter molecule or a 
cell according to any of claims 12 to 19 and at least 
one cell of the same cell type for use as a control 
and means for contacting said compound with one of 
said at least one transgenic cells. 

86. A composition comprising a vertebrate 
homologue according to any of claims 1 to 3 linked to 
a compound identified as an inhibitor or enhancer or 
association of said vertebrate homologue with 
microtubules or their plus end regions for use in 
targeting said compound to said microtubule or the 
plus end region thereof. 

87. A composition according to claim 86 which 
further comprises a cell transformation or 
transfecting agent. 



88. A method of targeting a protein to a cell 
microtubule or the plus end region thereof, which 
method comprises introducing into a host cell, tissue 
or organism a transgene comprising a sequence capable 
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of expressing a vertebrate homologue according to any 
of claims 1 to 3, which sequence is operably linked to 
a sequence encoding said protein to be targeted such 
that a chimeric protein is expressed and which results 
5 in targeting said protein to said microtubule or a 
plus end region thereof. 

89. A method of identifying a molecule which 
covalently modifies a vertebrate homologue of UNC-53 
10 according to any of claims 1 to 3 which method 
comprises : 

a) contacting an extract from a cell expressing 
said vertebrate homologue with a mixture of 
enzymes comprising candidate modifying enzymes in 

15 the presence of an inhibitor or covalent 

modification of a protein, 

b) identifying any covalently modified UNC-53 
protein from step a) , 

c) identifying said molecule involved in said 
20 modification step. 



90 



A method according to claim 89, wherein said 



indicator comprises 32 p, 



25 91. A method of identifying a compound which 

alleviates or enhances the toxicity of a vertebrate 
homologue according to any of claims 1 to 3, which 
method comprises contacting said compound with a cell, 
tissue or organism according to claim 18, and 

30 monitoring for the presence of said reporter molecule 
adjacent said microtubules or the plus end regions 
thereof. 

92. A vertebrate homologue of UNC-53 protein of 
35 C.elegans or a functional equivalent, derivative or 
bioprecursor therefor encoded by the nucleotide 
sequence in Figure la and which nucleotide sequence is 
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lacking in any of the nucleotide regions from position 
2873 to 3043, 3098 to 3121 or 3518 to 3526. 

93. A vertebrate homologue of UNC-53 protein of 
5 C.elegans or a functional equivalent, derivative or 

bioprecursor therefor having an amino acid sequence as 
illustrated in Figure lb and lacking in one or more of 
the regions from residues 958 to 1014, 1033 to 1040 or 
1173 to 1175, or which differs from said amino acid 
10 sequences in one or more conservative amino acid 
changes . 

94. A vertebrate homologue of UNC-53 protein of 
C.elegans or a functional equivalent, derivative or 

15 bioprecursor therefor encoded by the nucleotide 

sequence in Figure lc and which nucleotide sequence 
has from sequence position 1 to 366 replaced with any 
of the sequences identified as variants 1 to 3 of 
Figure lc and/or which sequences lack the region from 

20 position 5624 to 6024. 

95. A vertebrate homologue of UNC-53 protein of 
C.elegans or a functional equivalent, derivative or 
bioprecursor therefor having an amino acid sequence 

25 identified in Figure Id or the sequences of any of 
variants 1 to 3 replacing the amino acids from 
position 1 to 89 of the sequence of Figure Id and/or 
which sequence is lacking the amino acid sequence from 
position 1776 to 1778. 

30 

96. Plasmid pG313303 deposited under accession 
number LMBP 3936. 



35 



97. Plasmid pG13305 deposited under accession 
number LMBP 3937. 
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Figure la. Nucleotide sequence of Hs-unc-53/1 




CAGTGTGCCAAAAGAGACCCGCATGTACCCC^^^^ 2625 
AATGAGCCTCCCCAGTGCCTTCCCCAGCAGTACTCCCGTCCCCA^ 

gggaatgattcggtcaggacccttccgagaccccacgga^ 



3375 
3450 



5 

3750 



rrR^AACTGACCCTCCGGGTGGTGGTGAGGATGCCCCCGC AGCAC ATC ATC AAAGGGGACTTGAAGC 1 o 7 <; 




4575 

4650 



AGTCTCCCTCA/G^GGTCTGAAGGAGAAATGCGTCGACAGCCTGGTGTTCGAGACGCT^ 

g?IgScSca^cctcctgctgaagcaccggcgcctcgtcctctcgggccccagcgg^^^ 
?SgaccSt?gcttggccgagtacctggtggagcgctctggccg ?%l 

^AlcATCctcCAGcISTCTTGC^ 

ag^S??g^atotcccc?tggtgattctattggatgacctgagtg^ 

TG^G^C^C^C^CC^G^ A^GTATCAT AAATGTCCCTATATTATAGGTACCACC AATC AGCCTGTAAAAATGACACC 487s 



BNSDOCID: <WO 9963080A1J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/63080 



PCT/EP99/03848 



-~ * * A wv.«w x x v*™^_ x x ^^wax^-i-a^au^TTCTCCAACAACGTGGAGCCAGCCAATGGCTTCCTGGT 4950 
TCGTTACCTGAGGAGGAAGCTGGTAGAGTCAGACAGCGACATCAATGCCAACAAGGAAGAGCTGCTTCGGGTGCT 5025 
CGACTGGGTACCCAAGCTGTGGTATCATCTCCACACCTTCCTTGAGAAGCACAGCACCTCAGACTTCCTCATCGG 5100 
CCCTTGCTTCTTTCTGTCGTGTCCCATTGGCATTGAGGACTTCCGGACCTGGTTCATTGACCTGTGGAACAACTC 5175 
TATCATTCCCTATCTACAGGAAGGAGCCAAGGATGGGATAAAGGTCCATGGACAGAAAGCTGCTTGGGAGGACCC 5250 
AGTGGAATGGGTCCGGGACACACTTCCCTGGCCATCAGCCCAACAAGACCAATCAAAGCTGTACCACCTGCCCCC 5325 
ACCCACCGTGGGCCCTCACAGCATTGCCTCACCTCCCGAGGATAGGACAGTCAAAGACAGCACCCCAAGTTCTCT 5400 
GGACTCAGATCCTCTGATGGCCATGCTGCTGAAACTTCAAGAAGCTGCCAACTACATTGAGTCTCCAGATCGAGA 5475 
AACCATCCTGGACCCCAACCTTCAGGCAACACTTTAAGGGTTCGGCAATCACTGTCACCCCCGGACAGCAGAACG 5550 
CTGGCATCAGCTATCTTAGCTCCTCCTCTCCCCTCTCCTCTTTCAGAGCACTGGCTCTCCAGCCCCAGGAGGAGA 5625 
ACAGGAGGGAGGAGGAGATGAAAGAGGAGGGACAGGTTCTTGGTGCTGTACCTTTGAGAACTTCCTAGGAAGGAA 5700 
TGGTGGGGTGGCGTTTGGGAACTTGTGCCCCCTAAACACATTTACTGGCCTCCTCTAATGACTTTGGGGAAAAGA 5775 
TGATTCTGGGTCTTTCCCTTGACTTCTTGTTTCAATTACAAACTCCTGGGCTTTCTGGGGAGGGGTTCAGAAAAC 5850 
ATCAAAACACTGCAGCAGTTCCTAAATGATTCTCACAAGCAACCCTGAGAGAGACAGTCTTGTGAGGGAGATCTG 5925 
GGGGAGGCAGGAAGCTCCTCAGATTTTCTCACAGACCCTTCCCAATTCCATCACCACTGCCAACACTCGTCCGGA 6000 
ATTC 6004 

In frontal cortex, variants have been found lacking the region from position 
2873 to 3043 or the region from residues 3098 to 3121. The region from 3518 to 
3526 is absent in cDNA from Hela or colorectal adenocarcinoma tissue. All three 
regions are indicated in lower case letters in the figure above. Y at position 
3696 stands for C or T. Both nucleotides have been found to be present in cDNAs 
from different origin. 
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Figure lb. Amino Acid sequence of the protein encoded by Hs-unc-53/1 gene. 
Stretches encoded by the DNA sequences lacking in variants from frontal cortex 
are in lower case letters (residues 958 to 1014 ; 1033 to 1040 and 1173 to 
1175) . The x at position 1232 stands for Leucine or Serine, depending on the 
cDNA of origin. 

MLPKRAKAPGGGGGMAJCASAAELKVFKSGSVDSRVPGG 7 5 

KAPKGIXSKVGSKGREAPIjMSKTLSKSEHSLFQAKGSPAGG^ 150 

AVSETCKSDDELLSSKAKAQKSSGPVPSAKGQEERAFLK\n)PELVV^ 22 5 

DLRQOT,EETMSSLRGSQVTHSSLEMTCYDSDDANPRSVSSLSNRSSPLSWRYGQSSPRLQAGDAPSVGGSCRSEG 300 

TPAWYMHGERAHYSHTMPMRS PSKLSHI SRLELVESLDSDEVDLKSGYMSDSDLMGKTKTEDDDITTGWDES S S I 375 

SSGLSDASDNLSSEEFNASSSLNSLPSTPTASRRNSTIVLRTDSEKRSLAESGLSWFSESEEKAPKKLEYDSGSL 450 

KMEPGTSKWEUIERPESCDDSSKGGELKKPISLGHPGSLKKGKTPPVAVTSPITHTAQSALKVAGKPEGKATDKGK 525 

IAVKNTGLQRSSSDAGRDRLSDAKKPPSGIARPSTSGSFGY^^ 600 

VNGRKTSLDVSNSAEPGFLAPGARSNIQYRSLPRPAKSSSMSVTGGRGGPRPVSSSIDPSLLSTKQGGLTPSRLK 675 

EPTKVASGRTTPAPVNQTDREKEKAKAXAVALDSDNISLKSIGSPESTPKNQASHPTATKLAELPPTPLRATAKS 750 

FVTCPPSU^DKVNSNSLDLPSSSDTT^ 8 25 
SVPKETRMYPKLSGLHRSMESLQMPMSLPSAFPSSTPVPTPPAPPAAPTEEETEELTWSGSPRAGQLDSNQRDRN 900 
TLPKKGLRYQLQSQEETKERRHSHTIGGLPESDDQSELPSPPALPMSLSAKGQLTNIvsptaattpritrsnsip 97 5 
theaafelysgsojngstlslaerpkgmirsgsfrdpt^^ 1050 
SSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHL^ 1125 
SETTPKELRIKRQNSSDSISSI^SITSHSSIGSSKDADAKKKKKKSWvyeLRSSFNKAFSIKKGPKSASSYSDIE 1200 
EIATPDSSAPSSPKLQHGSTETASPSIKSSTxSSVGTDVTEGPAHPAPHTRLFHANEEEEPEKKEVSELRSELWE 1275 
KEMKLTDIRiEALNSAHQLDQLRETMH^ 1350 
THSFGPSLADTDLSPMTCISTCGPKEEVTLRVVVRMPPQHIIKGDL^ 1425 
KDYISKMDPASTLGLSTESIHGYSISHVKRVLDAEPPEMPPCRRGVNNISV^ 1500 

QHYISLLLKHRRLVLSGPSGTGKTYLTNRIJ^ I 575 
GIGDVPLVILLDDLSEAGSISELWGALTCKYHKCPYIIGTTNQPVKMTPNHGLHLSFRMLTFSNNVEPANGFLV 1650 

RYLRWCLVESDSDINANKEELLRVLDWW^ 1725 
IIPYLQEGAKIX3IKVHGQKAAWEDPVEWVTU3TLPWPSAQQD 1800 
DSDPLMAMLLKLQEAANYIESPDRETILDPNLQATL 1 83 5 
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Figure 1c. Nucleotide sequence of the Hs-unc-53/2 2 gene 

TAAGGCCXGGCGCCTCCTCTGCTACCCGCGCTC^ 75 

GGAAAGCCCAAGCCCCGGGAGAAGATGCCGGCCATCCTGGTC 153 

GTGCACAGCGCOaXCCC^TCC^^ 225 

AGCAAGGTGGAGGTGAGCAAGACCACCT 300 

GAGCCAGCCMGGGAGGGG^^ 375 

GACTGGGCCSUITGATTACCTAACCAAATCCGGCCACAAGCGTCTC^ 450 

GG CGTCCTCCP U GCCCAGATTATCCJVGGTTGT G GCAftATG 525 

AGATCCCAAATGATTGAAAAC^TAGATGCC^lXiCT^ 600 

TCTGCAGAAGAGATCAGGAATGGAAACCTCAAGGCCATTCTA 675 

auxytfsauxjKxrcccA^^ 750 

TCCCAGTGCCAGGCTGGCACCCCTCAGCAC^^ 825 

GCGCCACATCJU3CAGTCAAAAGCACAAGCTGAM 500 

<»TJUUlTCaUUVCCAGTCACCTC 1050 

TCCTCCCACCCCGGAATGAGTGACAATGCACCTGCTTCCTTGGA 1125 

AGTACCTCCTXXXXTC^TCCC^ 1200 

AGTGCCACGGTATCCATGCTCTCGGTCAAGCCTCCTGGGCCTGA 1275 

CCGGCCCCCAACAATCAGAAGTCCATGCTGGAAAAGCTGAAAC^ 1350 

GAGGGGCCGGGGTCCCTCGAC^ 1425 

^Gtfii^CGCCAjGTCGC^^ 1500---'- - 

CAGAGGACT' l ^ rA GCCGGGCACTGACCAACAAa 1575 

CAGCGGGAGAAGGATAAGGAGAAAAGCAAGGACCTTGCCAAGAGA 1650 

GAGGAGCCAAAACAAGACCCCAGTGGAGCAGCTGTGC 1725 

GGAATGAAGAGCATGCCCGGGAAATCCCCAAGTGCCCCAGCGCCTTC 1875 

AAGCTGAGCTraGGACTCCCCCAGCAGAAGC^^ 1950 

TCCTCAGAAGGAAAAGGCCCAGGAGGGACCACCCTGA^ 2025 

GGGACCACXXAGACCAC^IGGAAGCAATACCXT^^ 2100 

AACACTGCCACGGTTGCACCTTTCCTGTACATC 2175 

TCAACAGGTGTGAGCGTGGAGCCCAGCCACTTCACCAAGA 2250 

GATCCTGAGGCTCGGCGGCTGCGGAOUrrG^ 2325 

AGTTTAAGGGGAACTCAGGTTACACACAGCACATTGGAAACCACGT^ 2400 

GGCCGTAGCATACTCAGCTTGACAGGGAGGCC^ 2475 

CAAGCAGGAGACGCCCCCTCAATGGGCAATGGGTATCCCCCTCGAGCCA^ 2550 

TCAGGTCGCTATGTGTACTCCGCCCCTCTGAGAAGGCAGCTGGCCTCCC 2625 

GTCTCAGACAAGGCAGGAGATGAGATGGACCTGGAAGGCATCA 27CC 

GATGTTCTGAGCAAGAACATCCGGACCGATGACATTACAAGCGGATACATGACTC 2775 

ACCCGTCGCCTGAACCGGCTCCCTGATGGGATGGCTGTGGTACGGC^ 2 85 C 

CTCGGAGACGCTGACAGCTGGGACGACAGCAGCTCCGTCAGCAGC 2925 

ACTGATGACATCAACACCAGCTCCTCCATCAGCTCTTATG^ 3000 

GTGCAGACTGATGCTGAGAAGCACTCACAGGTGGAGAGGAATTC 3075 

GACGGAGGCTCAGACAGCGGCATAAAAATGGAGCCAGGTTCCAAGTGGAGGCG 315D 

GAKTCCGACAAAAGCACGTCGGGCAAGAAGAATCCTCTCATCTC 3225 

GCTCAGGTGGGCATCACCATGCCAAGGACGAAGGCTTC^ 3300 

AAAACAGACGACGCAAAGGTGTCTGAGAAAGGAAGGCTTTCTCCT 3375 

GATGCAGGCCGGAGCAGTGGTGACGAATCGAAAAAGCCCCTCCCCAGCAGC^ 3450 

AACAGCTT' I tSGGTTCAAGAAGCA.GAGTGGTTCCGCCGCCG 3525 

ACCAGCAGGTCAGCCACACTGGGCAAAATCCCAAAGTCATCTGCAC 3600 

AGTATGGATGGGGCTCAGAATCAGGATCACGGGTATCTAGC^ 3675 

TTGCCGAGGCCC^GTAAGTCCAACAGCCGGAACGGGGCTGGGAACAGGTCT 3753 

ATTAGCAGCAAGTCCGCAGGCCTGCCAGTGCCCAAACTGAGOT 3825 

CCAGGTCTGGTCAACCAAACAGATAAGGAGAAAGGCATCTCATCAGACAACGA 39 ZZ 

GTGAAAGTGAATCCGGCAGCCCAGCCTGTGTCCAGTC^ 3975 

GATGTGGCCTCTCCCACACTCCGCAGACTCTTTGGTGGGAAGCCTA 405: 

AACATGAAAAATTCGGTGGTCATCTCCAATCCTCATGCCACCATGACTCAGCAAGGTAACCT 412 = 

GGCAGTGGCGTCCTGAGCAGTGGGAGCAGCAGTCCTC^ 42: : 

GCCTCCAGCCCCAGCTCAGCCCACTCGGCCCCTTCCAAC^^ 4275 

GCAGTTAGCAAGGATGGCCTGGGCTTTCAGTCTGTCAGCAGCCTCCACACCAGCTGTGAGTCCATC 425 1 
CTCAGCAGTGGAGGGGTCCCCAGCCAC AATTCTTCC ACTGGCCTCATCGCCTCCTCCAAGGACGACTCCTTGACT 4 4 2 r 
CCCTTTGTCAGAACTAACAG7GTGAAGACCACACTGTCAGAAAGCCCTCTCTCTTCCCCTGCTGCTAGCCCTAAG 45 Z Z 

TTCTGCAGAAGTACTCTGCCCAGGAAACAGGACAGTGACCCGCACCTTGATAGGAA.CACITTGCCT 457; 

CTCAGGTATACTCCCACCTCCCAGCTTCGCA 4 £5 : 

GGCCTTCAGGACACCGCTGCCAATTCCCCCTTTTCCTCT^ 4725 
AA C TTTTCCCAGCTTGCGAGTCCCACCACTGTCACCCAGATGAGCTTGTCCAACCCGACCATGCTGAGGACTCAC 483 3 
AGCCTCTCCAATGCTGATGGGCAGTATGATCCATACACTGACAGCCGCTTCCGGAATAGCTCCATGTCCCTGGAT 437 = 
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Three variants have been found for the 5' end of the gene. For these variants, 
the sequence from position 1 to position 3 66 should be replaced by one of the 
following sequences : 

Variant 1 

TGAAGAGGTGGTGCTGATTTCCTTGGCTGGCGGGAACTCTGTCTGGCTGTTGCATGCATCACTTTTGTGTGGGTT 75 

ATTTTGTTCCTCTGTGGATTTGGAAGC ATCGCTGAAGGAGAGAGAGGATTTTATTTCTGGGAAATGGAATCGGTT 150 

TCTGAGTCCAGCCAACAGCAGAAGAGAAAGCCAGTTATCCACGGACTGGAAGATCAAAAGAGG 213 

Variant 2 

TGATACTTTGGGGTGCACATGGCTATTGATCTCTACTGCGGTTTGGCTTGTCTGTGGGGAATACATGAGCCCCGA 75 



TAACAACTGGACTTTATTGAGTGTTTACCATGCACCAAGCCCTGGGCTAAACACTTCATCTGCAGGCTGTTCGTC 75 

TTTACGGCAAACCCAGTAGGTAGGTATAACTATCCCCACTCTGCAGATGCAGAAACGGAGGCACAGAGTGTTTTG 150 

GTAGCTAAACAAGCTCACCAGGAGGCTAGAAGGTGGCCACACCTAGCTGGCCCCCCTGACTCCACCAACTGCCTC 225 

CCTTTGCTGTGTTGCATGCAAGAATGTGACTCCAAGTTTTTCCTTCCTTCTGGATCCAACTCTGGCTTCACTCTG 300 



Variant 3 



CTCAGCAACCAG 



312 
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2432 



AANYSSPQSYDSDSNSNSHrtUJJii-uoo^w 

Putative scare methanionines at ^^^JSSdS S^w"^^'^ ' seance . 
residue ac position 1018 ♦f^^^^^^L acid <E) can be incorporated. The 

Variant 1 25 
mESVSESSQQQKRKPVIHGLEDQKR 

Variant 2 19 
mAI DL YCGL AC LWG I HE P r 

Variant 3 24 
mQECDSKFFLPSGSNSGFTLLSNQ 
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150 



Figure le. Nucleotide sequence of Hs-unc-53/3. 
TAGAAGCATTTTCTTTGGCAGCAAGAAGATAATTTTATAGA^ 

AGGCAGCCAGCTGTTGGGTCAAAGCCTGTGCATACTGCTCTTCCGATACCAAATCTTGGCACTACTGGGTCACAG 
CACTGTTCTTCAAGACCTTTGGAACTTGCTGAAACAGAGAGCTCCATGCTTTCTTG^ 99 c 
ACCTGTGAATTTGGAGAGAAGAAACCCCTCCAAGGAAAAGCCAAGGAGAAAGAAGACAGC \tl 
TGGGCCAACCACTACCTAGCAAAATCAGGCCACAAGCGGCTGATCAAGGACTTGCAACAAGACATT^ „c 
GTACTCCTAGCAGAAATCATCCAGATTATTGCAAATGAAAAAGTTC * 
TCTCAGATGATTGAAAATGTTGATGTCTGCCTTAGTTTTCTAGCAGCCAGAGGGGTAA^ cor 
GCTGAAGAAATAAGAAATGGAAACTTAAAAGCCATTCTAGGGCTGTTTTTCAGTTTATC 600 
CAACACCATCAACAACAGTACTATCAGTCCTTGGTGGAACTTCAGCAGCGAGTO 675 
GAAGCCAGCCAGGCCAAAACCCAGCAAGATATGCAGTCCAGTCTGGCAGCCAGATATGCAACTCAGTCTAATCAC 750 
AGTGGAATTGCAACCAGTCAAAAAAAGCCTACTAGGCTTCCAGGGCCCTCTAGGGTGCCTGCTGCAGGAAGCAGC 825 
AGCAAGGTCCAGGGAGCCTCTAATTTAAATAGGAGAAGTCAGAGCTTTAACAGCATTGACAAAAACAAG^ 900 
AATTATGCAAATGGAAACGAAAAAGATTCCTCCAAAGGACCTCAATCGTCTTCAGGTGTAAATGGTAACGTGCAG 975 
CCTCCCAGTACTGCTGGGCAGCCTCCTGCCTCTGCCATCCCTTCTCCAAGTGCCAGCAAGCCCTGGCGCAGCAAG 1050 
TCCATGAATGTCAAACACAGTGCCACCTCCACCATGTTGACTGTAAAGCAGTCAAGTACAGCCACCTCCCCCACA 1125 
CCATCTTCAGACAGACTGAAGCCACCTGTCTCAGAAGGGGTCAAAACTGCTCCCTCAGGACAGAAATCCATGCOT 1200 
GAGAAATTCAAGCTAGTCAATGCCCGGACTGCTTTACGCCCCCCGCAGCCTCCCAGTTC^ 1275 
GGGAAGGATGATGATGCCTTTTCTGAATCTGGTGAAATGGAAGGTTTTAACAGTGG 1350 

ACAAATAGCAGTCCCAAAGTGTCACCTAAGTTGGCCCCTCCAAAAGCTGGAAGCAAAAATCTCAGCAATAAAAAG 1425 
TCTTTGCTACAGCCAAAGGAAAAAGAAGAAAAGAACAGGGACAAAAATAAAGTTTC 1500 

GAAGAGAAGGATCAGGTGACAGAGATGGCTCCAAAAAAGACCTCCAAAATTGCAAGCTTGATCCCTAAGGGCAGC 1575 
AAGACAACAGCAGCTAAGAAGGAAAGCTTAATTCCGTCTTCCAGTGGT^^ i»n 

ACAGTAAAGCAAACCATTTCACCTGGCAGCACAGCAAGCAAAGAGTCTGAGAAATTCAGGACTACCAA 1725 

CCTTCCCAGTCCTTATCTAAGCCTATAACCATGGAGAAAGCAAGTGCTTCTAGTTGTCCTGCCCCTTTGGAAGGA 1800 

AGGGAAGCTGGCCAAGCTTCTCCTTCTGGTTCCTGTACCATGACAGTGGCACAAAGCAGTGGGCAGAGCACAGGA 1875 

AATGGTGCTGTCCAACTCCCTCAACAGCAGCAACATAGCCACCCGAATACCGCGACAGTGGCACCATTCATTTAC 1950 

AGGGCACATTCAGAAAATGAAGGTACCGCTTTACCATCGGCTGACTCCTGTACCAGTCCTACAAAGATGGACTTA 2025 

TCATATAGTAAGACTGCTAAGCAGTGCCTGGAGGAGATATCTGGTGAAGGCCCTGAAACAAGAAGAATGAGAACA 2100 

GTTAAAAACATAGCAGACTTGAGGCAGAATTTAGAAGAGACTATGTCCAGTCTTCGTGGGACTCAGATAAGCCAC 2175 

AGCACCCTGGAGACAACATTTGACAGCACTGTGAC AACAGAAGTTAATGGAAGGACC ATACCCAACTTGACAAGT 2250 

CGACCCACCCCCATGACCTGGAGGTTGGGCCAGGCATGTCCGCGACTTCAGGCGGGAGATGCTCCCTCCCTGGGT 2325 

GCTGGCTATCCTCGCAGTGGTACCAGTCGATTCATCCACACAGACCCCTCGAGGTTCATGTATACCACGCCTCTC 2400 

CGTCGAGCTGCTGTCTCTAGGCTGGGAAACATGTCACAGATTGACATGAGTGAGAAAGCAAGCAGTGACCTGGAC 2475 

ATGTCTTCTGAGGTCGATGTGGGTGGATATATGAGTGATGGTGATATCCTTGGGAAAAGTCTCAGGACTGATGAC 2550 

ATCAACAGTGGGTACATGACAGATGGAGGACTTAACCTATATACTAGAAGTCTGAACCGAATACCAGACACAGCA 2 625 

ACTTCCCGGGACATCATCCAGAGAGGGGTTCACGATGTGACAGTGGATGCAGACAGCTGGGATGACAGCAGTTCA 2700 

GTGAGCAGTGGTCTCAGTGACACCCTTGATAACATCAGCACTGATGACCTGAACACCACATCCTCTGTCAGCTCT 2775 

TACTCCAACATCACCGTCCCCTCTAGGAAGAATACTCAGCTGAGGACAGATTCAGAGAAACGCTCCACCACAGAC 2850 

GAGACCTGGGATAGTCCTGAGGAACTGAAAAAACCAGAAGAAGATTTTGACAGCCATGGGGATGCTGGTGGCAAG 2925 

TGGAAGACTGTGTCCTCTGGACTTCCTGAAGACCCCGAGAAGGCAGGGCAGAAAGCTTCCCTGTCTGTTTCACAG 3000 

ACAGGTTCCTGGAGAAGAGGCATGTCTGCCCAAGGAGGGGCGCCATCTAGGCAGAAAGCTGGAACAAGTGCACTC 3075 

AAAACACCCGGGAAAACCGATGATGCCAAAGCTTCTGAGAAAGGAAAAGCTCCCCTAAAAGGATCATCTCTACAA 3150 

AGATCTCCTTCAGATGCAGGAAAAAGCAGTGGAGATGAAGGGAAAAAGCCCCCCTCAGGCATTGGAAGATCGACT 3225 

GCCACCAGCTCCTTTGGCTTTAAGAAACCAAGTGGAGTAGGGTCATCTGCCATGATCACCAGCAGTGGAGCAACC 3300 

ATAACAAGTGGCTCTGCAACACTGGGTAAAATTCCAAAATCTGCTGCCATTGGCGGGAAGTCAAATGCAGGGAGA 3375 

AAAACCAGTTTGGACGGTTCACAGAATCAGGATGATGTTGTGCTGCATGTTAGCTCAAAGACTACCCTACAATAT 3450 

^J^^ 0000 ^^ 3525 
AGTATTGATTCCAACGTCAGCAGC^ 3 6 0 0 

TCAGGGCGCTCGAGTCCTGTCACCGTCAACCAAACAGACAAGGAAAAGGAAAAAGTAGCAGTCTCAGATTCAGAA 3675 
AGTGTTTCTTTGTCAGGTTC^ 3750 

CCAGGATCCAAGTATCCAGATATTGCCTCACCCACATTTCGAAGgttgt ttggtgccaaggcaggtggcaaatct 3825 
gcctctgcacctaatactgagggtgtgaaatcttcctcagtaatgcccagccctagtaccacattagcgcggcaa 3900 
ggcagtctggagtcaccgtcgtccggtacgggcagcatgggcagtgctggtgggctaagcggcagcagcagccct 3975 
cccttcaataaaccctcagacttaactacagatgttataagcttaagtcactcgttggcctccagcccagcatcg 4050 
gctcactctttcacatcaggtggtctcgtgtgggctgccaatatgagcagttcctctgcaggcagcaaggatact 4125 
ccgagctaccagtccatgactagcctccacacgagctctgagtccattgacctccccctcagccatcatggctcc 4200 
ccgtctggactgaccacaggcactcacgaggtccagagcctgctcatgagaacgggtagtgtgagatctactctc 4275 
^Hf?2?? atgCagCttgacaga ^ 4350 
^ GGAAGAAGAGGGCA ^^ 4425 
G * GG T TTCCCCTTC ^ 4500 
TTGTCTCARTTTAACCTTCCCGGGCCCAGCATGATGCGCTCAAACAGCATCCCAGCCCAAGACTCTTCCTTCGAT 4575 
CTCTATGATGACTCCCAGCTTTGTGGGAGTGCCACTTCTCTGGAGGAAAGACCTCGTGCC ATCAGTCATTCGGGC 4650 
TCATTCAGAGACAGCATGGAAGAAGTTC ATGGCTCTTCATTATGACTGGTGTCCAGCACTTCTTCTCTTTACTCT 4725 
rn^nlnf^ 4800 
GCTACCCTCACATCTCAGCTTTCAGCAAATGCTCACCTTGTAGCAGCTTTTGAAAAGAGCTTAGGGAATATGACT 4875 
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4950 
5025 
5100 
5175 
5250 

MCCGGATKMAATGAMTTGAM^^ „00 

^^^^^^^^S^^~^—^^ 5775 




The region fro„ position 37SS to 4325 con, 1st • Jj^-.f ^.^i^Dm "L'Ses 
from 42S4 to 4325) that in *W2*^J y c |™.5 t 5"SS heterozygous tor the region 

AACAGACATGGGAAGAATCCAGTGAGTCACAAGCTAGAAGATCAGAAGAAG 
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Figure If. Protein sequence encoded by the Hs-unc-53/3 gene 

EKEDSKIYTDWANHYI^SGHKRLIKDLQQDIADGVLLAEIIQIIANEKVEDINGCPRSQSQMIENVDVC^FIA 
^^^^ L ^ EI ^ G ^^ I ^ LFFSLSR ^ QQ Q HH QQQYYQSLVELQQRVTHASPPSEASQAKTQQDMQSSL 
££££££ S ^ SGIATSQ ^ PT ^ PGPSRVP ^ GSSS ^^ASNI^SQSFNSIDKNKPPNYANGNEKDSSKGP^ 



75 
150 
225 



„^ "I""""" "*"«^' 1 ^ t « rj ^ vr ^^aoA.v«v»Ai>wiJ 1 JKKSyaraSXDKNKPPNYANGNEKDSSKGPO 300 

SSSGVNGNVQPPSTAGQPPASAIPSPSASKPWRSKSMNVKHSATSTMLTVKQSSTATSPTPSSDRLKPPVSEGVK 37^ 

TAPSGQKSMLEKFKLVNARTALRPPQPPSSGPSDGGKDDDAFSESGEMEGFNSGI^SGGSTNSSPKVSPKLAPPK 450 

AGSKNLSNKKSLLQPKEKEEKNRDKNKVCTEKPWEEKDQVTEMAPKKTSKIASLIPKGSKTTAAKKESLI PSSS 

,^^^ S ^ P ^ QTISreSTAS ^ SEKFRTCKGSPSQSLSKPI ^ I ^ ASS CPAPLEGREAGQASPSGSCTMT 

VAQSSGQSTGNGAVQLPQQQQHSHPNTATVAPFIYRAHSENEGTALPSADSCTSPTKMDLSYSKTAKOCLEEISG 

EGPETIUlMRTVKNIADLRQNLEETMSSLRGTQISHSTLETTFDSTVTTEVNGRTIPNLTSRPTPMTWRIiGOACPR 
^° AG ^ APS ^ AGYPRSGTSRFIHT DPSRFMYTTPLR^ 

ILGKSLRTDDINSGYMTDGGLNLYTRS1JTOIPDTATSRDIIQRGVHDVTVDADSWDDSSSVSSGLSDTLDNISTD 
DLI^SSVSSYSNITVPSRKNTQLRTDSEKRSTTDETWDSPEELKKPEEDFDSHGDAGGKWKTVSSGLPEDPEKA 
SLSVSQTGSWRRGMSA Q GGAPSR Q KAGT SALKTPGKTDDAKASEKGKAPLKGSSLQRSPSDAGKSSGDEGK 



5CTMT 600 
ilEISG 675 
>ACPR 750 
ISDGD 825 
900 
975 
X050 
1125 



" 1 TLQ " X kSL * k ^*SSTSGI PGRGGHRSSTSSIDSNVSSKSAGATTSKLREPTKIGSGRSSPVTVNQTDKE 1200 
KEKVAVSDSESVSLSGSPKSSPTSASACGA0GLRQPGSKYPDIASPTFRRlfgakaggksasapntegvksssvml275 
pspsttlarqgslespssgtgsitigsagglsgsssplfnkpsdlttdvislshslasspasvhsftsgglvwaanm 1350 
f S ^^ dtpsyqsmtslhtssesidl P ls ^ sls ^ttgthevqsllmrtgsvrstlse S mqldrntlpkkg 1425 
eHT*®?* Q ^ QETC ^ 1500 
SIPAQDSSFDLYDDSQLCGSATSLEERPRAISHSGSFRDSMEEVHGSSLSLVSSTSSLYSTAEEKAHSEQIHKLR 1575 
RELVASQEKVATLTSQLSANAHLVAAFEKSLGNMTGRLQSLTMTAEQKESELIELRETIEMLKAQNSAAOAAIOG 1650 
AI^GPDHPPKDLRIRRQHSSESVSSINSATSHSSIGSGNDAD^^ 1725 

f^f*?^^^ 1800 
LQLKSELREKELKLTDIRLEALSSAHHLDQIREAMNRMQNEIEILKAENDRLKAETGNTAKPTRPPSESSSSTSS 1875 
SSSRQSLGLSLNNI^ITEAVSSDILLDDAGDATGHKIX3RSVKIIVSISKGYGRAKDQKSQAYLIGSIGVSGKTKW 1950 
£YL^Y IRRLFKEYWRIOTS ^ 2025 
o5fY^ TLIPKPITQRY ^^ HHRIILSGPSGTGKTYL ^^ 2100 
QYLANLAEQCSADNNGVELPWI ILDNLHHVGSLSDIFNGFLNCKYNKCPYIIGTMNQGVSSSPNLELHHNFRWV 2175 
LCANHTEPVKGFLGRYLRRKLIEIEIERNIRNNDLVKIIDWIPKTWHHLNSFLETHSSSDVTIGPRLFLPCPMDV 2250 
EGSRVWFMDLWNYSLVPYILEAVREGLQMYGKRTPWEDPSKWVLDTYPWSSATLPQESPALLQLRPEDVGYESCT 2325 
STKEATTSKHIPQTDTEGDPLMNMLMKLQEAANYSSTQSCDSESTSHHEDILDSSLESTL 2385 

Regions corresponding to heterozygous sequences encoding presence or absence of 
this "gion are in lower case letters. These regions are from 1326 to 1413 ; 
from 1414 to 1427 ; from 1703 to 1709 and from 1768 to 1788. 

Putative start methionines at positions 1 and 51 are indicated in lower case. 

For the variant mentioned in figure le, the amino acid sequence from position 1 
to 81 has to be replaced by the following amino acid sequence : 

mDLSSEmNRHGKNPVSHKLEDQKK 24 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9963080A1_I_> 



• PCT/EP99/03848 

Piaure la Nucleotide sequence of a 4984 bp fragment from BAC 585E09 (contains 

paS of ?he ge^Sc sequence of Hs-unc-53/1) extending the sequence derived from 
cDNA libraries shown in figure la. 



1500 
1575 
1650 
1725 

CGAGCTGCTCTCCAGCAAGG^C^G^ "?,° 

^^s^^ssssss^sssssss^sss^s^ss, lilt 

2475 



EESS^ 2550 



CAGGCGGGAAGGGAGGAACAAAGCTTGCTCGAG' 



1TGGAGGAAGCGCGCAGAGCTGTTCCATTGTTCTCCGTGCCTG 



1AAGCGGGCCCCGGAACTGCTCTTTCTCTCCCCGGAGAGCCCCTGCCCTCAGA 



=SSSSScccnm« = 3750 



3525 
3600 



Sa^SccS^^ 4125 

SSSSgg^ 4350 

GC^gISg^G^^^ 4425 

SSccto^gLgggcagagggtagactgcc^ J*™ 

G^CCAGAGCCAATCATGGTGGTGTTCAC^TATCAGACAGGCCCTCAGTGTACAGC 4575 
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TGTCCAGTTATGTTCACTCCATAGTACCATCCTAGATCCAAGAGGCTGCCAAGAATCAATTTCTGAGGCGGAGGG 4725 

AGGGGGTGGGAGTGAGGCAGCTTCAAGTCAGAGCCTTTCTGTAATAAGAGGGAAGGACTGAAACCTGATCATCCC 4800 

i???? 30 *^^ 4875 

AATAACTAGGGCCCCACTGGGGAACCCTAGCAACTTGGAAGACTGAGGAGTGAGTACCGAGGGCAAATGGGCTAA 4950 
TTCCAGGAATTAGATGCCTCTGGACCCTGGCCCG *™ 

The sequence shown in figure la starts at position 1246. Upstream in the same 
reading frame as used for the translation of the DNA sequence in fig la into the 
protein sequence of fig lb, a stop codon is found at position 815. A first 
putative start codon (ATG) can be found at position 1124. Assuming this star^ 
codon, the protein sequence from fig lb is extended by the sequence 
MLGSSVKSVQPEVELSSGGGDEGADEPRGAGRKAAAADGRG 

Intronic sequence has been found to start at position 1881. 
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Figure lh 
Nagase et al 



Illustration of a 5'-deletion variant of Hs-unc-53/3 discovered by 
(1999, DNA Res. 6:63-70). 



^vtaa0938 Torotem, amino atxu o<= M »*^w — 

DSMEEVHGSSLSLVSSTSSLYSTMbRAna v D7 „ TI! Mi.vAn NSAAOAAIOGALNGPD 



LVAAFEKSiejmT^U=^»-~v"--- SGNDADSKKKKKKNWV NSRGSELRSSFKQ 
HPPKDLRIRRQHSSE^SSI^AT^ 



AFGKKKSTKPPSSHSDIEELTD: 



RQSLGLSLNNLNITEAVSSDILLDDAGDATGHKDGR 



TGOT"AKPTRPPSESSSSTSSSSS^^-~-_ kti ^ REYVFRIDTSTS 
SVXIIVSISKGYGRAKDQK^AYLIGSIG^^^ 



>AB023155 cDNA nucleotide sequence 

actgtcattg aattgtactg cattagaaag 
tcttgtcacc tttagttggc ctttttcaat 
aattattttt tatagttcag agaaccattt 
tacattatgt ccttaggggt tttctttgtt 
tacccaaaaa gggactaaga tataccccat 
aagagtggtt gcgttctcat tctactggag 
tggtttcccc ttctgccatg tcatcttctg 
tgagcccaac aaatttgtct caatttaacc 
gcatcccagc ccaagactct tccttcgatc 
ccacttctct ggaggaaaga cctcgtgcca 
tggaagaagt tcatggctct tcattatcac 
cagctgaaga aaaggctcat tcagagcaaa 
cacaagaaaa agttgctacc ctcacatctc 
cttttgaaaa gagcttaggg aatatgactg 
aacaaaagga atctgaactt atagaactaa 
attctgctgc ccaggcggct attcagggag 
atcttcgcat cagaagacag cattcctctg 
gccattccag tattggcagt ggtaatgatg 
gggtgaactc tagaggaagt gagctgagaa 
agtccaccaa gcctccttca tcacattctg 
cggcatcccc caagttaccc cataatgctg 
cacaatctgc ttcagcgatc tgtgaatgca 
tgaagagcga gctcagagaa aaggaattaa 
gctctgctca tcatcttgat cagatccggg 
aaatactgaa agctgaaaat gaccggttga 
ctcggccacc gtcagaatcc tcaagcagca 
gactttctct aaacaatttg aacatcacag 
atgctggcga tgcaaccgga cataaagatg 
gcaagggcta tggtcgagca aaggaccaaa 
gtgttagtgg aaaaaccaag tgggatgtct 
aatatgtatt ccgaattgat acatccacta 
gctactgtat aggagactta attagatccc 
gtggatacct tgttggagat aataacatca 
atagtttgga cagttttgtt tttgatacgc 
ttaacttgtt gatggagcat cacagaatta 
cctatttggc aaacaaactt gctgaatatg 
aggatgcaat tgccactttt aatgtggacc 
tagctaacct ggctgaacag tgcagtgctg 
taattcttga taatctccat catgtgggct 
attgtaaata caacaaatgt ccatatatta 
caccaaatct agagctgcat cacaatttca 
cagtgaaagg ctttttaggc agatatcttc 
ggaacattcg caataatgac ctagtcaaaa 



ctatcactaa 
gcaatggaca 
gttaccaaaa 
aagttttgtt 
agaaatacac 
gaagagggca 
cagtcacctc 
tctaacttgg 
cgctcaaaca 
tgtgggagtg 
agagacagca 
ctttactcta 
ctggttgcat 
cttgtagcag 
atgacagcgg 
aaggctcaga 
cctcccaaag 
agtgccacaa 
aagaaaaact 
gggaagaaaa 
tcatcccttc 
atgaagccct 
attctgcagc 
gaggccctca 
aatgaaattg 
gctaagccta 
cagtcattag 
ttgctagatg 
gtctccataa 
ggatccattg 
ctctttaagg 
tgcattgcta 
ttgctgcctt 
gtagaagaaa 
caaaggtact 
actggaaaga 
aaaaaaacag 
caacaatatc 
ccagttgtaa 
ggttttctca 
gtttcttcat 
catacagaac 
gaaattgaaa 



gaactcaaat 
gagttaagca 
ttgttggatg 
ttaacagcat 
catctcggca 
ggcttcagga 
cagctggaaa 
ttcccgggcc 
tctatgatga 
tcagtcattc 
tggtgtccag 
tccataaact 
agctttcagc 
gccgattgca 
gagaaaccat 
cactgaatgg 
aaagtgtttc 
ccgactccaa 
gttctttcaa 
acattgaaga 
gtgactgtgg 
cagaagctga 
aattaacgga 
aagccatgaa 
aggcagaaac 
cctcctcttc 
aggctgttag 
gccgcagtgt 
aatctcaggc 
tagatggtgt 
gccttggtct 
ataacctaga 
tcactgtgaa 
tgattcctaa 
tactctcagg 
taataaccaa 
acaagtcaag 
ataataatgg 
ctctgagtga 
ttggaacaat 
ggtgggtatt 
gaagaaaact 
ttatagattg 



atgtgtgacg 
ttatatgtgt 
tgtaatttgg 
gcagcttgac 
ggccaaccaa 
cactggcaac 
ataccacttt 
cagcatgatg 
ctcccagctt 
gggctcattc 
cacttcttct 
gcggagagag 
aaatgctcac 
aagtctaact 
tgaaatgctg 
tccagaccat 
tagtatcaac 
gaagaagaaa 
acaagccttt 
gcttactgat 
ctcagcatcc 
ggcagagata 
tattcggctg 
ccggatgcag 
tggtaacaca 
atcttccagg 
ctcagatatt 
gaaaattata 
atatttgata 
aataagacgt 
gagctctgac 
agtgcctgaa 
cctcaaaggg 
accaattacc 
accgagtggt 
atctggaagg 
taaggaattg 
agtggagctc 
tatcttcaat 
gaatcaggga 
atgtgcaaat 
catagagata 
gattccgaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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acgtggcatc atctcaacag ttttttggaa acacacagtt cttctgacgt taccattggt 
ccccgactat tccttccttg ccccatggat gtagaaggtt ctagagtatg gttcatggat 
ctctggaact attctttagt accttatatt ctggaggcag tgagagaggg tcttcagatg 
tatgggaaac gcacaccatg ggaagatcct tcaaagtggg tgcttgacac atatccatgg 
agctcagcaa ctctgcctca ggagagccca gccttacttc agctgcgacc agaagatgtt 
gggtatgaaa gctgcacatc cactaaggaa gccacaacct caaagcacat tccgcaaact 
gacacagaag gagatcccct gatgaatatg ctaatgaaac tccaagaagc agccaattac 
tcgagcacac aaagctgcga cagcgaaagc accagccacc atgaagacat tttggattca 
tctctcgaac ctaccctcta gagggtgaaa aaagttaagg gaaaagactt tgcttttaaa 
aaaatgtttc aaaagaaagg tattttcact aaaccactgc cagtataaaa gcaccctgtc 
aagggccctg acccagagtt gtggtctcca aggaggcagc agaactaagt ctgaaccgcc 
aagatgctaa attgcaatgg aagcttaact ttagtttatt tctaagcatt ttttatatct 
gtggagtaat agaaagctcc attactcaac tggaaaggac cctaatgaca gggcaactga 
acagattgca catgggatag ccaaactgga ctttctttgt ttcctcttta aaagtttaca 
atgcagacca ttttttgtcc cttccttttg tttcctctga ggggctgttc gccccaggca 
gggtccatct ttctgatctg tccaacctcc tttgtgccac acggtgctgg tcacagggct 
tcagtagtgt ttgtgttgtg cgctcacccc attccagaac aaatccaaga ggccagtcct 
ccataagcac aaatggaatt gtgcaaccac cagaaaaaca ctactgtggc aaactggaga 
agtgccaatt taattctaac tgccacgttc tcatgatgtg ctccaccaac tttttagtat 
atgagtcact ggttttataa ggttgttttt accacagtgg tctttttaaa ccacctgccc 
actcccttaa caagagtttt ataccaatta ttagtcaaca ctgataaaag gcttttttag 
ggctttattt gtttgagcct tttcagtgaa agaaggaaca tttcctatgg tgctgtctca 
ctgccttaaa acagatttct atgacagttt aacagttggt ttaaatccta aaccattggt 
aatttccact gtcttttcat ttacaaccaa gcaacaccag ttaacatagt agcctcatct 
ctatatatct ttctcttttt tttttttttt tgaagaaatg gataggagaa agatcagtat 
ttttagcctt gtgaatagat cgctttgcct atcctccaaa atattaaaat aacccagaaa 
tgctctttga ccgtcactta aaacctaaga catgtggcga aattccatcc agttctaagt 
gaaagagttt cagaaggcag gagattttga attattatcc agcagggctg gaagcactag 
atgcagcatg agcacaacta ttcggctttc cttccctatt gtttttgttt ttttaatgag 
ttttgacgca tgttgttttg attgctattg ttgtacatga gaaattcagc attaaagaac 
actgaagcgg taaggtcact gtggaagagg aagcgtttat actgtaaaag aaggttagat 
ttgcacagtc tactgggtag gtattgtaaa taataatttt taaaacttgc acaaatcaaa 
acaaacacaa acaaaattgt attttatcct attggtgtta agaggtgttt cacttgctga 
gatttcctgt acattgcaaa caaatacaga atgcaaaccc tcaaagctgt attatctggt 
gtgtttgtcc tgtatttaca gttgtttttg actatgcagg agctatcagt gctagagtga 
gcatgcttca aaactgtaca tgaagccaat atatttttgg ataagtaaaa ctgtctgaaa 
gtacatctgt catggcaggc tttaaagaga gtgcatgaaa actgatcagt cattggagaa 
gttaccacca cacacaaagg acaggtttta agtttatgaa acccaagggc taggccatgg 
tatagacttc ttctatgagt gtgtgaaaat gtgttacttt taggacgtgt atttggtgct 
actctctgtg accaccaatg ggtcagttgc tatagaacaa caacaccacg aaacatctgt 
gcagttttca gagtgtcaca aagtcaatag gtccttacac ggtgctattg ccctaaggga 
aatccgaact gaatttatgc acatagaatt gtcaccctga ctttgaagcc tcaaacatgg 
atcaaatctg ttgtgaaaca tcaatatatg tagctggatg agtgactagt ttcccttgta 
taatatgtga tctaagaaaa ttgctaatct ttccctgcca ttttgagaaa cacagtccaa 
acatgagcat aaacagaatt tcctgcaata catcccagta ggtccaccta gtttacaact 
taaactagtt tgtgaaacat ttgtctgtat acattttata ttttgtacat tttgatgtaa 
catatcatgt aaataggcag aaacagtgaa ataaatcatc tgaaaagttt tgtagtcttt 
gtaaagcccc aacaataagt acttggtgtc aatggactta actggatgat gtattttcta 
ttggcttatt gttcctctag cttgtaaacc agcttgcata tatttttttg caaatgtgca 
ccctgtatct gtctaaatta ttactttgcc attaaagtgg aattatttat tgac 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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Figur* 2: Illustration of a multiple sequence alignment between the different 
members of the Unc-53 protein family. 
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MTTSNVELIP . IYTDWANRHLSKGSLSKS IRDI3NDFRDYRLVSQLINV 
61 TCEFGEKKPLQGKAKEKECSK . IYTDWA17HVLAKSGHKJU,IKDLQQDIAIXj^ItAEI IQI 
4 VSESSQQQKRKPVIHGLEDQKRIYTrWANHYLTK^^ 
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Hs-unc-53/3 
Hs-unc-53/2 
Hs-unc-53/1 



49 I VP I NEFS PAFTKRLAX ITSKLDGL ETCLD YLKNLGLDCSKLTKTD IDSGNLGAVI. 
120 IA. . NEKV^INGCPRSCSCMIEXWUVTTL^ 
64 VA. . NEKI EDINGC PKNRSQMI £371 DACLNFLAAKGINXQGL SAEEIRNGNLKAIL 
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Hs-unc-53/3 
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Hs-unc-53/1 



105 QILFLLSTYKQKLRQLKKDQKKLEOLPTSIMPPAVSKLPSPRVATSATASAT 

174 Gt-FFSLSRYKQ. . * CQHHQQQ « . YYQSI.VELQQRVTH . A3PPSEASQAXTQQOM0S C SLAA 
113 GLFFSLSRYKQ . . . QQQQPQK . , QHIiS . SPLPFAVSCVASAPSQCQAGTPQQQVPV.TPQA 
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157 . .NPNSNFP . QMSTSRLQTPQ SRISKIDS . . SKIGIKPKTSGLKPPSSSTTSSNNT .NSF 

22 S . . RYATQSNH SGXATSQKKPT 3 RLPGP . . SRVPAAGSSSKVQGA 5KL. .NRRSQSF 

17 2 PCQPHQPA PHQQ SKAQAEMQS RL3GP . TARVS AAG SEAXTRGG STTANNRRSQSF 
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211 R P5SR SSGNNHVGSTISTSA.KSLE3SSTYSSISNX2IR. * 

2 77 NSIDKNK. . . . FFNYANGNEKDS . SXGPQSSSG . . VNCNVQPPSTAGQ PPAS 

2 26 NNYDXSKFVTSPPPPPSSHEXEPIiASSASSHPG. .KSDNAPASLESGSS . STPTOCSTSS 
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32 2 AT?S? . SAS . KPWRSKSMNVKHEATSTMLTVKQS STATS FTPS 3 . . . DRLKP . PVSE6VX 
283 AIPQPOAAT.KPWRSKSLSVKHSATVSML3VIC PPGPEA. ..PR FTPEAMK 
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32 9 PAPNNQK. . . . SMLEKLXLFNSKGGSKAGEGPGSRDTSCERX-ETLPSFEESEELEAASRM 
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476 VCTEK . PVKEEKDQ VTEMAPXXTSKIASLI PKG SKTTAAKKESLI P 

440 LAKRASVTERLDLKEEPKEDPSG . . . AAVFEM. PXKSSKIASFIPKGGKLNSAKKEPMAP 
1 . .MLPKRAXAPCCGGGMAFASAAELKVFXSGSVDSRVPGGPPASNLRKQKSLT 

3 81 A? 1 1 SQQDSKRC SKS SEEESG YAGFNSTSPT SS STEGSLM . H 5TS SKS S 

S23 SSSGIPKFGSKVPTVKOTISPGSTASXESEKFRTTKGSPSQSLSKP . IT . MEXASASSCP 
496 SHSGIPKPGMKSMPCKSPSAP. , APSXEGERSRSGKLSSGLPQQXPQLDG .RKSSS5S5L 
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lH iS^SpGG" TtLnHSISSQTVSQSVGTTQTTGSNTVSVQLP. • .QPQO0YKHPOTAT 

476 VXGVKS7AKKDPPPAV. . PPRDTQPTIG . .V. VSPIMAHKKLTNDPVISEK. . . . .?EPE 
630 VAPFI VRAHSENEGTALPSADSCT . S . P . . TKMDL . . S . Y5KTAKQCLEEISGE . . . GPE 
III VAPFLYRSQTDTEGNV . .TAESSS.T G . - VSVEP . .SK7IKTGQPAiEELTGE . 
160 DSLLS SKAKAQKSSGPVPSAXGQE . E . RAFLKVDP . 
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Ce-unc-53 714 SRSSTSVDSRSRAEQENVYKLLSOCRTSQRGAAATSTPCQHSLRSPG. . . . YSSYS 

Hs-unc-53/3 1299 GSAGGLSGSSSPLFNKPSDLTTDVTSLSHSLAS SPASVHSFTSGCLVWAANMSMS 
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Cc-unc-53 1151 EVRLDNL2RAREVDVLRETVNKLKTENKQLKKEVDKL . . . TNGP . .ATRASSRAS 1^ 

Hs-^c-S3/2 laOO DIRLEAZSSAKQIjDQLREAMNRMCSEIEKLKAENDRLKSESQGSG . C ^^£^Xf" 1 ' ^ 
Ss-^.c-53/1 1310 DIRLEALNSAHQLDQLRETMKNMQLEVDLLKAENDRLKVAPGPSSGST . . PGQVPGsSAI. 



Hs-unc-53 3 1923 . .K*QA . VLIG IGVS ^.cv^yv^pxEviXHVDPVSOLG-JIS.OSVl^SI 
!;:™:Si!l "« ::gS:JrSSS' ) .GKVr 3 W.XM iD£ AVF 3 VFKDVI S KHDP A ST LG LSr.ESIKGVSI 

S e "^"«/ 3 gdlS ^Se^ellpc^vgdnniitvnlkgveensldspvfdtlipkpitqry 

Hs-unc~53/3 1S87 ^^• • •^- Tp E^pcsyL^ENTriS^GLAENSLDSLVFESLIPKPILCRY 

S:^:S/? 1473 S55: : '• '.^Seppemppcrr. .gvnn.i^kglkekcvdslvfetlipkpmmqky 

^ VTCS1LTERJU-VIAGATGIGKSKLAKTLAAYVSIRTNOS -EDSIV . NISIPENNKEELLQ 

C.-u«c-53 1364 ^f^^2^SGTGKTVXANnAEYrcWS«Wa^^ 
hs-unc-53 /3 2042 ™£^^*^g TGK ^^ 

,,,, vFPPLrK-^RSKESC IVILDNIPKNRIAFWSVFANVPLON . - .NEGPFWCTVK 

S ire"- 3 2 1^83 Y^™VO^ 

Hs-une-53/1 1591 YLSNLANQIDRETGIGDVPLVIIXDDl, . . SEAGSISEZ.V . NG&L . 7CK.YHKCPYIIQTTW 

e-j t YOT?E^GIPHNFKMSVMSNRI.E. . . GFXLRYLRRRAVEDEYRLTVQMPSEwFKr I 

S"^-"/2 2139 QATSSTP^OLHHNFRVT^CAiraTEPVKGrLCRFLRRKlJlETEISGRVPJW . EkVRII 
Ha-u^.c-^l 1647 S^NHBWl^FWILTrsi™ 

Ce-unc-53 1526 DFFPI At QAVNNF IEKTNSVDVTVGPRACLNC PLTVDG SRSWFI ^''''J*^^* j[^?!lY^ 
Hs-Sc-sl/3 2215 DWI FXTWHHLNSFLSTHS S SDVTIGPRLFLPCPMDVEG SRVWF^ffiLWOTSLVPYILEAV 

X-Jnt-w" 1704 ^P^HLHTFLEK«STSDFLICPCFFL S CPIGIEDFRTWFIDLWKN S IIPVX.QEGA 

Ce-unc-S3 1585 RDGKKTFGRCTSFEDFTDIVSKKWPWFDGENPEK . . . -^JSSS^^^SJS! 

Hs-unc-53/3 2274 REGLQMYGKRTPWEDPS)CWVLDTY.FW . - SSATLPQESPALLQliRPEDVG^ESCISTXEAT 

K5-^C-53 '2 2255 REGLQIVGRRAPWEDPAXWVMDTYPW . . AASPQQHEWPPLLQLRPEDVGFDGYSMPREGS 

S-^c-"/? "« roGI^HGOKAAWEDPVEWVRSTLFW . .PSA. . QQDQSKLYHLFPPTVGPHSIASPPEDR 

Ce-UftC-53 163 5 SSRQ HFNPL . ESLICL . KATKH . . . QTIDMI 

Hs-unc-53/3 2332 TSKHIPCTC^EGDPLffloeMKLQEAANYSSTQSCDSE . . STSHKEDILDSSLESTL 

Hs-^c-53/2 2313 TSXQMPPSDAEGDPLKNMIJIRXQEJUlNYSSPQSYDSDSNSNSHHDDILOSSLESTL 

Hs-unC-53/1 1819 TVKDSTPSSLDSDPLKAMLLiaQEAANY . . IESPDRET ILDPNLQATL 
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Fiffuxe 3: Illustration of a auiicipie 5e3uer.cs (lUanwe.nt becwoar. C. elegans Unc-5J 
:C®) and C. Brigg 3 ae Unc-53 (CbJ . 

Cfc 1 OTTSNVELIPlYTCT^lRJiLSKCy^SRPII^ 
" e 1 ^ Str ^ IPI1fT I>WANRKLSKGS:LSKSIRDISND^^ 

Cb 8 1 ^G^CSJa.TX7DJDS&^GAVLQIAFLLSSYXQKLR«:j^^ 

C* 81 KNLGU3CSKLTKTD I DSGNLGAVLC^LrLL STYKQKLP.QUOCDC fC<L£0-F TSIMPPAVSKLf SPRVATSATASA 

Cbl 6 X SNPNSNFTQMST SRLOTPQSRISKPSSTKI CIKPXTTSGLRPP . STTS SMTNINSFRPSSES55NNNVOSTI STSARSLD 

" fe ' i56 ^^^P0**STSPJL£TPQSRI3KISSSIU.GIKPX.rSGIi^^ 

Cfc240 SSSAYSSISNL5RFTPSS025KPTSRLQTC«VRVATTC^ SGML 

Ce23 5 SSSTYSSISJCJWT. . SSLQKF. SRPCTCLVTlVATTTKIGl-SKirAPKAVSTPICiASVJn'I . GAKQEPDNSGGGGCGKL 

Cb316 KLKLFSSKKASSSNNSPQP1.RKA EU . . ^KIAAP^G^KPPTSfH'^G^.r^KLCTPfO/SVRKPiyri^JlTK^ 

Ce3il lO.KLFSSIQgPSSSSNSPQPTRKAAAVPQaQTLSKIAAS'VXSC-iAPPTS . . . KLCSATSMSH-CTPKVSYHKTnAPIISCQ 

Cb3 89 DSKRCSKSSEE2SGYACTKSTSPASSSTEGSI-SraSTSSX£S7SD£K£PSSDD^TL:a£IV*rA13CPIAT^ VSP VI'K 
C*338 DSKRCS^SEEESCYACFNSTEPTSSSTEGSLSMHSTS^^ 

Cb4£3 WEEKPTLAVKft\ SASIOTPPKVrER^ 

Ce463 PV^rPTiAV^GVKSTAXXDPPFAVPPRDTCPTI.7VVSPlMAinua-rNDPV:SEK. . PEP EKLQSMS TDVTDVyPL f . PL 

Cb546 K£^VPPI^pr« lC pp ? YDT.XV-K;GKl-rsPVKSFGVI)C\'BffSASEE£TVAH . . '/JMAPPVQJTTSAGQSSMERRIQBC^KT 
C*54:> KSV. -VPLRMTSIHCPPTYDVLLKCC-KITSPVXSFGYEQ. . SSASEES IVAHASACVTPPr . Kl'S . CNKSIiERRMGjCJKT 

Cb624 SE5SGYA5EAGVAliCAKMR£KIiKEYrD>frRRAQNGYPDNFXD5SSLSSGIJ^^ 
Ca619 SESSCYTSI^7AMCAJMREKLKEY£D>rrRRAONGYPI^ 

Cb?0 4 KTVRHTSSS5SRPRVPSP.F STSVDSS.5SVE0ENVYXLI.5CCRTSCRGAATAT3SFS<3HSl-RSPGYSSy SPHXTV3ADKDT 
Ce699 HFVRPJ>TSSS5XPBVPSR5ST£VCSR3R^0ENVYKLr.SQCRTSQRGAA . ATFTFG-gKSI.REPGy££ySPHI.SV5AOKDT 

Cb784 M^SQTSIU^SS3K?SYACCFHSL::RXCHLQEF7SAEH^^ 

C*7? 8 MS>«S3TSRRPSSSXEr/SGQFHS;L!:PJCCH13^ SSCSYSARSRCCSSTGIYSET 

Cb86* FQl^RI.SDErjSF;^SARSEKSSC:.S:A£TTAYCT^ 
Cc85 / PCt.KPXS0EXSPAHSAKSEKC=QL£;.A£TTAYG5I^YI^ 

Ce937 -TQHIDRSNlJtfEXAZKFRCDIAm^i;^^ 

061017 KNKicWXRS SLSKFTKXKNXNYDEAHMPSI SGSQGTIJ3NII^^E1^2EiJCERDSALVXVRI,D^CJ)RASEVD^RSlVl^Kl. 

Sill? "^^"^fj^^^^^ 

C«12 3 7 nOGWWMWVDSLVU^LPTO^ 
C^337 S£5£5E!!^^ 

Le 1.4 9 < I ?^ERVAIU:cXi:TFGKCTSFS=PTDr7SKJ<WPWEGENPI3^ICRLCLCDI.vT SPANSSRQHFNP^SLXOLHATK 
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Figure 4. Prosite Signaturtss 
JLEXCO.UT 

Block C, Vert«brat«: 

RX<S,T)PlL,M><S,T) WRXGQ ( S , A > XFRLQAGDAPS 

Block 13, Vertfibrata: mi „ fV * 

<ra .«>x.««.i.v.i»<«.t.v.i»u.o...t,««., a . 1 .i<».»>'».o...*.o<».-» 

ss 

Block F, Vertebrate: 
DRKTLPKXGLRY 

Bioe* o. Large family: c Tv (JV „ e T)S(0 2 ) XY ( A . G . S . T > XX ( E , N ) E ( K » 

GSXCI.L,V.A)SL<I,L,V.A)S<A ; G . S T > ( A v, . x ^J x J££i J . LXXXTXX 
R) X ( 4 . 5 ) I ( R . H ) X ( L . M > XR ( D ; E ) LXXXXXXVXX- •JJ"****^ # Q , H ^ XXXX ( A . S ) XA (H . 

i^ifxixX^^ 

Block G , Vertebrate: 

SGSFROXXCCE) (E.D>VHGSXLSLtV.A>SS|T.A>SSXY^^ 

JI(R.H)XLRRELX(A ;S>S0SXVX<T ; A>LTJT o < f X £^^ V J^XxiXGX ( L , I ) N ( A . 
I)MTXRL(Q.R)XI.XXTAE(a. S > XXXEI.XXLRX . I \ D .EIWM* \ . DQS)s(I(V)SS (i,L> 
O.S.I)X(N.O.Q.E)XXXKXlC. 6) (N . D . Q , E > LR. < K . R > RQXS S < N . D . W , - » 

NSXTSHSSXGS 

Block H. Large Saraiiy: 

( V , L > DSXVX ( D . E J XL ( X . L) PXX { M . L . V. . I > XXXX XXX ( - . I 
(A.S)GX(T,S)GXGX(T. S ) XL ( A . T) XXLXXY (M . L . -J . I > XX , R . K } 

and 

PCE.N)XX(I.L)HXXF(K.R)XXX(A.S)NXXEX(0,3)GF(L.I)XP.(Y.F)L(K.R)(K.R) <K,Ri 
X(M.L.V,I) (D.E) 

L»EXXX(T. S )X,=. E ,XXXGPXXX ! L.I)XCP(M. L ,V,1,X(V I)(O.E)XX(R,X)XWFXXL 
WNXXXU VlPYa'llXXXIA.VHR.KI ( D , E ) CXXXXGXX ( T . A ) X ( F , Y . W ) EDP 

Bloc* H, Vertebra-a: 

(V.LIMXVFCD.El C T . S ) ^IPKP (M , L , V , I ) XQXYXXLL ( M . L . V , I , XHXR ( I , L ) (I.VIL3GP5 
GTGKTYL I A , T ) NRLXE Y ' M , Z* . V , I ) XX ( R , K ) GR 
and 

LHXXFRXXXX(A.3)NXXEP(A.V)XGFLXR(Y.FJHK,R) ( K . R ! ( K . R ) L < M . L . V . 1 
&r d 

(R.K) (V.I) i L.;)DWXPKXWXa(I.LjXXFLEXHS(r.S:SDXXXGPXXFLXCP(M.L.V,I)X(V < I 
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Figure 7b: Illustration of the plasmid *]£ or 
nucleotide sequence of the P GI3303 «~J"^f ° r ( ° 
terminal Hs-unc-53/3 fragment m fusion wxth GFP) 
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FT 

SQ 



PGI3303 
CDS 



CDS 



CDS 



CDS 



promoter 



circular DNA; 8140 BP 

1225. .2019 
/vntiikey="4" 
/label=Kan/Neo 
3942. .4658 
/vntifkey:^" 
/label=eGFP 
4719. .8102 
/vntifkey^M" 
/label=Hs-unc-53/3 

4659. .4718 
/vntifkey="4" 
/label^linker 
3330. .3918 
/vntifkey="29" 
/label=Pcmv 
SEQUENCE 8140 BP; 

CTAGATAACT GATCATAATC AGCCATACCA CATTTGTAGA 
ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 
TGTTTATTGC AG CTT AT AAT GGTTACAAAT AAAGCAATAG 
AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA 
GCGTAAATTG TAAGCGTTAA TATTTTGTTA AAATTCGCGT 
TCATTTTTTA ACCAATAGGC CGAAATCGGC AAAATCCCTT 
GAGATAGGGT TGAGTGTTGT TCCAGTTTGG AACAAGAGTC 
TCCAACGTCA AAGGGCGAAA AACCGTCTAT CAGGGCGATG 
CCCTAATCAA GTTTTTTGGG GTCGAGGTGC CGTAAAGCAC 
AGCCCCCGAT TTAGAGCTTG ACGGGGAAAG CCGGCGAACG 
AAAG CG AAAG GAGCGGGCGC TAGGGCGCTG GCAAGTGTAG 
ACCACACCCG C CGCGCTT AA TGCGCCGCTA CAGGGCGCGT 
AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC 
ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA 



GGTTTTACTT 
TGCAATTGTT 
CATCACAAAT 
ACTCATCAAT 
TAAATTTTTG 
AT AAAT C AAA 
CACTATTAAA 
GCCCACTACG 
TAAATCGGAA 
TGGCGAGAAA 
CGGTCACGCT 
CAGGTGGCAC 
ATTCAAATAT 
AAAGGAAGAG 



GCTTTAAAAA 

GTTGTTAACT 

TTCACAAATA 

GTATCTTAAC 

TTAAATCAGC 

AGAATAGACC 

GAACGTGGAC 

TGAACCATCA 

CCCTAAAGGG 

GGAAGGGAAG 

GCGCGTAACC 

TTTTCGGGGA 

GTATCCGCTC 

TCCTGAGGCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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GAAAGAACCA 

CAGGCAGAAG 

CAGGCTCCCC 

TCCCGCCCCT 

CCCATGGCTG 

TATTCCAGAA 

GAGACAGGAT 

GCCGCTTGGG 

GATGCCGCCG 

CTGTCCGGTG 

ACGGGCGTTC 

CTATTGGGCG 

GTATCCATCA 

TTCGACCACC 

GTCGATCAGG 

AGGCTCAAGG 

TTGCCGAATA 

GGTGTGGCGG 

GGCGGCGAAT 

CGCATCGCCT 

TGACCGACCA 

ATGAAAGGTT 

GGGATCTCAT 

AGACAATACC 

GTGTTGGGTC 

CCCCACCGAG 

CCCCAAGTTC 

TAGCCTCAGG 

GGATCTAGGT 

CGTTCCACTG 

TTCTGCGCGT 

TGCCGGATCA 

TACCAAATAC 

CACCGCCTAC 

AGTCGTG TCT 

GCTGAACGGG 

GATACCTACA 

GGTATCCGGT 

ACGCCTGGTA 

TGTGATGCTC 

GGTTCCTGGC 

CTGTGGATAA 

TTAGTTCATA 

GGCTGACCGC 

ACGCCAATAG 

TTGGCAGTAC 

AAATGGCCCG 

TACATCTACG 

GGGCGTGGAT 

GGGAGTTTGT 

CCATTGACGC 

TTAGTGAACC 

AGCTGTTCAC 

AGTTCAGCGT 

TCATCTGCAC 

ACGGCGTGCA 

CCGCCATGCC 

ACAAGACCCG 

AGGGCATCGA 

ACAGCCACAA 

AGATCCGCCA 

CCCCCATCGG 

CCCTGAGCAA 

CCGCCGGGAT 

AAGCTTCGAA 

CCTCACCCAC 

CTAATACTGA 

GGCAAGGCAG 

TAAGCGGCAG 

GCTTAAGTCA 

TCGTGTGGGC 

AGTCCATGAC 

GCTCCTTGTC 

GTAGTGTGAG 

AGGGACTAAG 

TGCGTTCTCA 



GCTGTGGAAT 

TATGCAAAGC 

AGCAGGCAGA 

AACTCCGCCC 

ACTAATTTTT 

GTAGTGAGGA 

GAGGATCGTT 

TGGAGAGGCT 

TGTTCCGGCT 

CCCTGAATGA 

CTTGCGCAGC 

AAGTGCCGGG 

TGGCTGATGC 

AAGCGAAACA 

ATGATCTGGA 

CGAGCATGCC 

TCATGGTGGA 

ACCGCTATCA 

GGGCTGACCG 

TCTATCGCCT 

AGCGACGCCC 

GGGCTTCGGA 

GCTGGAGTTC 

GGAAGGAACC 

GTTTGTTCAT 

ACCCCATTGG 

GGGTGAAGGC 

TTACTCATAT 

GAAGATCCTT 

AGCGTCAGAC 

AATCTGCTGC 

AGAGCTACCA 

TGTCCTTCTA 

ATACCTCGCT 

TACCGGGTTG 

GGGTTCGTGC 

GCGTGAGCTA 

AAGCGGCAGG 

TCTTTATAGT 

GTCAGGGGGG 

CTTTTGCTGG 

CCGTATTACC 

GCCCATATAT 

CCAACGACCC 

GGACTTTCCA 

ATCAAGTGTA 

CCTGGCATTA 

TATTAGTCAT 

AGCGGTTTGA 

TTTGGCACCA 

AAATGGGCGG 

GTCAGATCCG 

CGGGGTGGTG 

GTCCGGCGAG 

CACCGGCAAG 

GTGCTTCAGC 

CGAAGGCTAC 

CGCCGAGGTG 

CTTCAAGGAG 

CGTCTATATC 

CAACATCGAG 

CGACGGCCCC 

AGACCCCAAC 

CACTCTCGGC 

TTCTGCAGTC 

ATTTCGAAGG 

GGGTGTGAAA 

TCTGGAGTCA 

CAGCAGCCCT 

CTCGTTGGCC 

TGCCAATATG 

TAGCCTCCAC 

TGGACTGACC 

ATCTACTCTC 

ATATACCCCA 

TTCTACTGGA 



GTGTGTCAGT 

ATGCATCTCA 

AGTATGCAAA 

ATCCCGCCCC 

TTTATTTATG 

GGCTTTTTTG 

TCGCATGATT 

ATTCGGCTAT 

GTCAGCGCAG 

ACTGCAAGAC 

TGTGCTCGAC 

GCAGGATCTC 

AATGCGGCGG 

TCGCATCGAG 

CGAAGAGCAT 

CGACGGCGAG 

AAATGGCCGC 

GGACATAGCG 

CTTCCTCGTG 

TCTTGACGAG 

AACCTGCCAT 

ATCGTTTTCC 

TTCGCCCACC 

CG CGCTATG A 

AAACGCGGGG 

GGCCAATACG 

CCAGGGCTCG 

ATACTTTAGA 

TTTGATAATC 

CCCGTAGAAA 

TTGCAAACAA 

ACTCTTTTTC 

GTGTAGCCGT 

CTGCTAATCC 

GACTCAAGAC 

ACACAGCCCA 

TGAGAAAGCG 

GTCGGAACAG 

CCTGTCGGGT 

CGGAGCCTAT 

CCTTTTGCTC 

GCCATGCATT 

GGAGTTCCGC 

CCGCCCATTG 

TTGACGTCAA 

TCATATGCCA 

TGCCCAGTAC 

CGCTATTACC 

CTCACGGGGA 

AAATCAACGG 

TAGGCGTGTA 

CTAGCGCTAC 

CCCATCCTGG 

GGCGAGGGCG 

CTGCCCGTGC 

CGCTACCCCG 

GTCCAGGAGC 

AAGTTCGAGG 

GACGGCAACA 

ATGGCCGACA 

GACGGCAGCG 

GTGCTGCTGC 

GAGAAGCGCG 

ATGGACGAGC 

GACGGTACCG 

TTGTTTGGTG 

TCTTCCTCAG 

CCGTCGTCCG 

CTCTTCAATA 

TCCAGCCCAG 

AGCAGTTCCT 

ACGAGCTCTG 

ACAGGCACTC 

TCAGAAAGCA 

TCATCTCGGC 

GGGCTTCAGG 



TAGGGTGTGG 
ATTAGTCAGC 
GCATGCATCT 
TAACTCCGCC 
CAGAGGCCGA 
GAGGCCTAGG 
GAACAAGATG 
GACTGGGCAC 
GGGCGCCCGG 
GAGGCAGCGC 
GTTGTCACTG 
CTGTCATCTC 
CTGCATACGC 
CGAGCACGTA 
CAGGGGCTCG 
GATCTCGTCG 
TTTTCTGGAT 
TTGGCTACCC 
CTTTACGGTA 
TTCTTCTGAG 
CACGAGATTT 
GGGACGCCGG 
CTAGGGGGAG 
CGGCAATAAA 
TTCGGTCCCA 
CCCGCGTTTC 
CAGCCAACGT 
TTGATTTAAA 
TCATGACCAA 
AGATCAAAGG 
AAAAACCACC 
CGAAGGTAAC 
AGTTAGGCCA 
TGTTACCAGT 
GATAGTTACC 
GCTTGGAGCG 
CCACGCTTCC 
GAGAGCGCAC 
TTCGCCACCT 
GGAAAAACGC 
ACATGTTCTT 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGAGT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CGGTCGCCAC 
TCGAGCTGGA 
ATGCCACCTA 
CCTGGCCCAC 
ACCACATGAA 
GCACCATCTT 
GCGACACCCT 
TCCTGGGGCA 
AGCAGAAGAA 
TGCAGCTCGC 
CCGACAACCA 
ATCACATGGT 
TGTACAAGTC 
CGGGCCCGGG 
CCAAGGCAGG 
TAATGCCCAG 
GTACGGGCAG 
AACCCTCAGA 
CATCGGTTCA 
CTGCAGGCAG 
AGTCCATTGA 
ACGAGGTCCA 
TGCAGCTTGA 
AGGCCAACCA 
ACACTGGCAA 



AAAGTCCCCA 
AACCAGGTGT 
CAATTAGTCA 
CAGTTCCGCC 
GGCCGCCTCG 
CTTTTGCAAA 
GATTGCACGC 
AACAGACAAT 
TTCTTTTTGT 
GGCTATCGTG 
AAGCGGGAAG 
ACCTTGCTCC 
TTGATCCGGC 
CTCGGATGGA 
CGCCAGCCGA 
TGACCCATGG 
TCATCGACTG 
GTGATATTGC 
TCGCCGCTCC 
CGGGACTCTG 
CGATTCCACC 
CTGGATGATC 
GCTAACTGAA 
AAGACAGAAT 
GGGCTGGCAC 
TTCCTTTTCC 
CGGGGCGGCA 
ACTTCATTTT 
AATCCCTTAA 
ATCTTCTTGA 
GCTACCAGCG 
TGGCTTCAGC 
CCACTTCAAG 
GGCTGCTGCC 
GGATAAGGCG 
AACGACCTAC 
CGAAGGGAGA 
GAGGGAGCTT 
CTGACTTGAG 
CAGCAACGCG 
TCCTGCGTTA 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
CATGGTGAGC 
CGGCGACGTA 
CGGCAAGCTG 
CCTCGTGACC 
GCAGCACGAC 
CTTCAAGGAC 
GGTGAACCGC 
CAAGCTGGAG 
CGGCATCAAG 
CGACCACTAC 
CTACCTGAGC 
CCTGCTGGAG 
CGGACTCAGA 
ATCCAAGTAT 
TGGCAAATCT 
CCCTAGTACC 
CATGGGCAGT 
CTTAACTACA 
CTCTTTCACA 
CAAGGATACT 
CCTCCCCCTC 
GAGCCTGCTC 
CAGAAATACA 
AGAAGAGGGC 
CCAGTCACCT 



GGCTCCCCAG 
GGAAAGTCCC 
GCAACCATAG 
CATTCTCCGC 
GCCTCTGAGC 
GATCGATCAA 
AGGTTCTCCG 
CGGCTGCTCT 
CAAGACCGAC 
GCTGGCCACG 
GGACTGGCTG 
TGCCGAGAAA 
TACCTGCCCA 
AGCCGGTCTT 
ACTGTTCGCC 
CGATGCCTGC 
TGGCCGGCTG 
TGAAGAGCTT 
CGATTCGCAG 
GGGTTCGAAA 
GCCGCCTTCT 
CTCCAGCGCG 
ACACGGAAGG 
AAAACGCACG 
TCTGTCGATA 
CCACCCCACC 
GGCCCTGCCA 
TAATTTAAAA 
CGTGAGTTTT 
GATCCTTTTT 
GTGGTTTGTT 
AGAGCGCAGA 
AACTCTGTAG 
AGTGGCGATA 
CAGCGGTCGG 
ACCGAACTGA 
AAGGCGGACA 
CCAGGGGGAA 
CGTCGATTTT 
GCCTTTTTAC 
TCCCCTGATT 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTGGT 
AAGGGCGAGG 
AACGGCCACA 
ACCCTGAAGT 
ACCCTGACCT 
TTCTTCAAGT 
GACGGCAACT 
ATCGAGCTGA 
TACAACTACA 
GTGAACTTCA 
CAGCAGAACA 
ACCCAGTCCG 
TTCGTGACCG 
TCTCGAGCTC 
CCAGATATTG 
GCCTCTGCAC 
ACATTAGCGC 
GCTGGTGGGC 
GATGTTATAA 
TCAGGTGGTC 
CCGAGCTACC 
AGCCATCATG 
ATGAGAACGG 
CTACCCAAAA 
AAAGAGTGGT 
CTGGTTTCCC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
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CTTCTGCCAT 
CAAATTTGTC 
CCCAAGACTC 
TGGAGGAAAG 
TTCATGGCTC 
AAAAGGCTCA 
AAGTTGCTAC 
AGAGCTTAGG 
AATCTGAACT 
CCCAGGCGGC 
TCAGAAGACA 
GTATTGGCAG 
CTAGAGGAAG 
AGCCTCCTTC 
CCAAGTTACC 
CTTCAGCGAT 
AGCTCAGAGA 
ATCATCTTGA 
AAGCTGAAAA 
CGTCAGAATC 
TAAACAATTT 
ATGCAACTGG 
ATGGTCGAGC 
GAAAAACCAA 
TCCGAATTGA 
TAGGAGACTT 
TTGTTGGAGA 
ACAGTTTTGT 
TGATGGAGCA 
CAAACAAACT 
TTGCCACTTT 
TGGCTGAACA 
ATAATCTTCA 
ACAACAAATG 
TAGAGCTGCA 
GCTTTTTAGG 
GCAATAATGA 
GTTTTTTGGA 
GCCCCATGGA 
TACCTTATAT 
GGGAAGATCC 
AGGAGAGCCC 
CCACTAAGGA 
TGATGAATAT 
ACAGCGAAAG 
AGAGGGTGAA 



GTCATCTTCT 
TCAATTTAAC 
TTCCTTCGAT 
ACCTCGTGCC 
TTCATTATCA 
TTCAGAGCAA 
CCTCACATCT 
GAATATGACT 
TATAGAACTA 
TATTCAGGGA 
GCATTCCTCT 
TGGTAATGAT 
TGAGCTGAGA 
ATCACATTCT 
CCATAATGCT 
CTGTGAATGC 
AAAGGAATTA 
TCAGATCCGG 
TGACCGGTTG 
CTCAAGCAGC 
GAACATCACA 
ACATAAAGAT 
AAAGGACCAA 
GTGGGATGTC 
TACATCCACT 
AATTAGATCC 
TAATAACATC 
TTTTGATACG 
TCACAGAATT 
TGCTGAATAT 
TAATGTGGAC 
GTGCAGTGCT 
TCATGTGGGC 
TCCATATATT 
TCACAATTTC 
CAGATATCTT 
CCTAGTCAAA 
AACACACAGT 
TGTAGAAGGT 
TCTGGAGGCA 
TTCAAAGTGG 
AGCCTTACTT 
AGCCACAACC 
GCTAATGAAA 
CACCAGCCAC 
AGCCGAAATC 



GCAGCTGGAA 
CTTCCCGGGC 
CTCTATGATG 
ATCAGTCATT 
CTGGTGTCCA 
ATCCATAAAC 
CAGCTTTCAG 
GGCCGATTGC 
AGAGAAACCA 
GCACTGAATG 
GAAAGTGTTT 
GCCGACTCCA 
AGTTCTTTCA 
GACATTGAAG 
GGTGACTGTG 
ACAGAAGCTG 
AAATTAACGG 
GAAGCCATGA 
AAGGCAGAAA 
ACCTCCTCTT 
GAGGCTGTTA 
GGCCGCAGTG 
AAATCTCAGG 
TTAGATGGTG 
AGCCTTGGTC 
CATAACCTAG 
ATCACTGTGA 
CTGATTCCTA 
ATACTCTCAG 
GTAATAACCA 
CACAAGTCAA 
GATAATAATG 
TCTCTGAGTG 
ATTGGAACAA 
AGGTGGGTAT 
CGAAGAAAAC 
ATTATAGATT 
TCTTCTGACG 
TCTAGAGTAT 
GTGAGAGAGG 
GTGCTTGACA 
CAGCTGCGAC 
TCAAAGCACA 
CTCCAAGAAG 
CATGAAGACA 
CAGCACACTG 



AATACCACTT 
CCAGCATGAT 
ACTCCCAGCT 
CGGGCTCATT 
GCACTTCTTC 
TGCGGAGAGA 
CAAATGCTCA 
AAAGTCTAAC 
TTGAAATGCT 
GTCCAGACCA 
CTAGTATCAA 
AGAAGAAGAA 
AACAAGCCTT 
AGCTTACTGA 
GCTCAGCATC 
AGGCAGAGAT 
ATATTCGGCT 
ACCGGATGCA 
CTGGTAACAC 
CATCTTCCAG 
GCTCAGATAT 
TGAAAATTAT 
CATATTTGAT 
TAATAAGACG 
TGAGCTCTGA 
AAGTGCCTGA 
ACCTCAAAGG 
AAC CAATT AC 
GACCGAGTGG 
AATCTGGAAG 
GTAAGGAATT 
GAGTGGAGCT 
ATATCTTCAA 
TGAATCAGGG 
TATGTGCAAA 
TCATAGAGAT 
GGATTCCGAA 
TTACCATTGG 
GGTTCATGGA 
GTCTTCAGAT 
CATATCCATG 
CAGAAGATGT 
TTCCGCAAAC 
CAGCCAATTA 
TTTTGGATTC 
GCGGCCGTTA 



TTCTAACTTG 
GCGCTCAAAC 
TTGTGGGAGT 
CAGAGACAGC 
TCTTTACTCT 
GCTGGTTGCA 
CCTTGTAGCA 
TATGACAGCG 
GAAGGCTCAG 
TCCTCCCAAA 
CAGTGCCACA 
AAAGAAAAAC 
TGGGAAGAAA 
TTCATCCCTT 
CATGAAGCCC 
AATTCTGCAG 
GGAGGCCCTC 
GAATGAAATT 
AGCTAAGCCT 
GCAGTCATTA 
TTTGCTAGAT 
AGTCTCCATA 
AGGCTCCATT 
TCTCTTTAAG 
CTGCATTGCT 
ATTGCTGCCT 
GGTAGAAGAA 
CCAAAGGTAC 
TACTGGAAAG 
GAAAAAAACA 
GCAACAATAT 
CCCAGTTGTA 
TGGTTTTCTC 
AGTTTCTTCA 
TCATACAGAA 
AGAAATTGAA 
GACGTGGCAT 
TCCCCGACTA 
TCTCTGGAAC 
GTATGGGAAA 
GAGCTCAGCA 
TGGGTATGAA 
TGACACAGAA 
CTCAAGCACA 
ATCTCTTGAA 



GTGAGCCCAA 
AGCATCCCAG 
GCCACTTCTC 
ATGGAAGAAG 
ACAGCTGAAG 
TCACAAGAAA 
GCTTTTGAAA 
GAACAAAAGG 
AATTCTGCTG 
GATCTTCGCA 
AGCCATTCCA 
TGGGTGAACT 
AAGTCCACCA 
CCGGCATCCC 
TCACAATCTG 
CTGAAGAGCG 
AGCTCTGCTC 
GAAATACTGA 
ACTCGGCCAC 
GGACTTTCTC 
GATGCTGGTG 
AGCAAGGGCT 
GGTGTTAGTG 
GAATATGTAT 
AGCTACTGTA 
TGTGGATACC 
AATAGTTTGG 
TTTAACTTGT 
ACCTATTTGG 
GAGGATGCAA 
CTAGCTAACC 
ATAATTCTTG 
AATTGTAAAT 
TCACCAAATC 
CCAGTGAAAG 
AGGAACATTC 
CATCTCAACA 
TTCCTTCCTT 
TATTCTTTAG 
CGCACACCAT 
ACTCTGCCTC 
AGCTGCACAT 
GGAGATCCCC 
CAAAGCTGCG 
TCTACCCTCT 



5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8140 



Legend: P GI3303 was obtained by inserting the 3421 bp 
BaLl/Spel fragment of the Hs-Unc53/3G L d22 PCR^l D02 in 
a BamHI/Xbal opened pEGFPcl vector (Clontech Inc ) Tins 
plasmid encodes an eGFP protein in fusion with the C- 
?erminal half of Hs-unc-53/3 (last 1128 AA) . Arrows 
indicate the ORFs . 
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Figure 7c: Illustration of the AA sequence of GFP: :C- 
terminal Hs-unc-53/3 fragment ( insert of pGI3303) 

MVSKGEELFTGWPILVELIX5DWGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGV 
QCFSRYPDHMKQHDFFKSAMPEGWQERTIFFKDIXSNYKT^ 
GHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNI^ 
LSKDPNEKRDHM\^LEFVTAAGITLGMDELYK S 

AKAGGKSASAPNTEGVKSSSVMPSPSTTIAROGSLESPSSGTGSMGSAGGL^jSSSPLFNKPSDLTTW 
ISLSHSLASSPASVHSFTSGGLVWAANMSSSSAGSKDTPSYOSMTSLHTSSESIDLPLSHHGSLSGLTT 
GTHEVOSLIiMRTGSWSTLSESMOLDRNTLPKKGLRYTPSSROANOEEGKEWLRSHSTGGLODTGNOSP 
LVSPSAMSSSAAGKYHFSNLVSPTNLSOFNIjPGPSMMRSNSIPAODSSFDLYDDSOLCGSATSLEERPR 

aishsgsfrdsmeevhgsslsiivsstsslystaeekahseoihklrrelvasoekvatltsoiisanahl 
vaafekslgnmtgriiosltmtaeokeselieiiretiemlkaonsaaoaaiogalngpdhppkdlrirro 
hssesvssinsatshssigsgndadskkkkkknwvnsrgselrssfkoafgkkkstkppsshsdieelt 
dsslpaspklphnagix:gsasmkpsosasaicecteaeaeiilolkselrekelkltdirlealssahh 
ldoireamnrmoneieilkaendrlkaetgntakptrppsesssstsssssroslglslnnlniteavs 
sdillddagdatghktcrsvkiivsiskgygrakdoksoayligsigvsgktkwdvldgvirrlfkeyv 
fridtstslglssdciasycigdlirshnlevpellpcgylvgdnn^ 

pkpitoryfnllmehhriilsgpsgtgktylanklaeyvitksgrkktedaiatfnvdhksskelooyl 

ANIiAEOC S ADNNGVEL PWI ILDNLHHVGSLSDIFNGFLNCKYNKCPYI IGTMNOGVSSSPNLELHHNF 
RVA/LCANHTEPVKGFI/jRYLRRKIjIEIEIERNIRNNDLVKIIDWIPKTWHHLNSFLETHSSSDVTIGPR 

lflpcpmdvegsrvwfmdlw^slvpyileavreglomygkrtpwedpskwvldtypwssatlpoespa 
llolrpedvgyesctstkeattskhipotdtegdplmnmlmkloeaanysstoscdsestshhedilds 

SLESTL 

Legend: Single underlined AA sequence represents eGFP. 
Double underlined AA sequence represents the C-terminal 
fragment of Hs-unc-53/3 
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Figure 7d: Illustration of the plasmid map and the 
nucleotide sequence of the pGl3305 expression vector 
(full length Hs-unc-53/3 in fusion with GFP) 



Sac II (1) 




Xho I (4689.) 



ID PGI3305 circular DNA; 11842 BP 

FT CDS 1245.. 2039 

FT /vnti£key="4 rt 

FT /label=Kan/Neo 

FT CD s 3895.. 10983 

FT /vntifkey^A" 

P T /iabel=hHs-unc-53/3\ (fullMength) 

FT CDS 3962.. 4678 

FT /vntifkey=M M 

FT /label=eGFP 

FT promoter 3350.. 3938 

FX /vntifkey="29 n 

FT /label=Pcmv 

SQ SEQUENCE 11842 BP; 

GGGCCCGGGA TCCACCGGAT CTAGATAACT GATCATAATC AGCCATACCA CATTTGTAGA 60 

GGTTTTACTT GCTTTAAAAA ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 120 

TGCAATTGTT GTTGTTAACT TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG 180 

CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA 240 

ACTCATCAAT GTATCTTAAC GCGTAAATTG TAAGCGTTAA TATTTTGTTA AAATTCGCGT 300 

TAAATTTTTG TTAAATCAGC TCATTTTTTA ACCAATAGGC CGAAATCGGC AAAATCCCTT 3 60 

ATAAATCAAA AGAATAGACC GAGATAGGGT TGAGTGTTGT TCCAGTTTGG AACAAGAGTC 420 

CACTATTAAA GAACGTGGAC TCCAACGTCA AAGGGCGAAA AACCGTCTAT CAGGGCGATG 480 

GCCCACTACG TGAACCATCA CCCTAATCAA GTTTTTTGGG GTCGAGGTGC CGTAAAGCAC 540 

TAAATCGGAA CCCTAAAGGG AGCCCCCGAT TTAGAGCTTG ACGGGGAAAG CCGGCGAACG 600 

TGGCGAGAAA GGAAGGGAAG AAAGCGAAAG GAGCGGGCGC TAGGGCGCTG GCAAGTGTAG 660 

CGGTCACGCT GCGCGTAACC ACCACACCCG CCGCGCTTAA TGCGCCGCTA CAGGGCGCGT 7 20 

CAGGTGGCAC TTTTCGCGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC 7 80 

ATTCAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA 840 

AAAGGAAGAG TCCTGAGGCG GAAAGAACCA GCTGTGGAAT GTGTGTCAGT TAGGGTGTGG 900 

AAAGTCCCCA GGCTCCCCAG CAGGCAGAAG TATGCAAAGC ATGCATCTCA ATTAGTCAGC 960 

AACCAGGTGT GG AAAGTCCC CAGGCTCCCC AGCAGGCAGA AGTATGCAAA GCATGCATCT 1020 

CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 1080 

CAGTTCCGCC CATTCTCCGC CCCATGGCTG AC T AATTTTT TTTATTTATG CAGAGGCCGA 1140 
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GGCCGCCTCG 
CTTTTGCAAA 
GATTGCACGC 
AACAGACAAT 
TTCTTTTTGT 
GGCTATCGTG 
AAGCGGGAAG 
ACCTTGCTCC 
TTGATCCGGC 
CTCGGATGGA 
CGCCAGCCGA 
TGACCCATGG 
TCATCGACTG 
GTGATATTGC 
TCGCCGCTCC 
CGGGACTCTG 
CGATTCCACC 
CTGGATGATC 
GCTAACTGAA 
AAGACAGAAT 
GGGCTGGCAC 
TTCCTTTTCC 
CGGGGCGGCA 
ACTTCATTTT 
AATCCCTTAA 
ATCTTCTTGA 



GCCTCTGAGC 
GATCGATCAA 
AGGTTCTCCG 
CGGCTGCTCT 
CAAGACCGAC 
GCTGGCCACG 
GGACTGGCTG 
TGCCGAGAAA 
TACCTGCCCA 
AGCCGGTCTT 
ACTGTTCGCC 
CGATGCCTGC 
TGGCCGGCTG 
TGAAGAGCTT 
CGATTCGCAG 
GGGTTCGAAA 
GCCGCCTTCT 
CTCCAGCGCG 
ACACGGAAGG 
AAAACGCACG 
TCTGTCGATA 
CCACCCCACC 
GGCCCTGCCA 
TAATTTAAAA 
CGTGAGTTTT 
GATCCTTTTT 



TATTCCAGAA 
GAGACAGGAT 
GCCGCTTGGG 
GATGCCGCCG 
CTGTCCGGTG 
ACGGGCGTTC 
CTATTGGGCG 
GTATCCATCA 
TTCGACCACC 
GTCGATCAGG 
AGGCTCAAGG 
TTGCCGAATA 
GGTGTGGCGG 
GGCGGCGAAT 
CGCATCGCCT 
TGACCGACCA 
ATGAAAGGTT 
GGGATCTCAT 
AGACAATACC 
GTGTTGGGTC 
CCCCACCGAG 
CCCCAAGTTC 
TAGCCTCAGG 
GGATCTAGGT 
CGTTCCACTG 
TTCTGCGCGT 



TGGCTTCAGC 
CCACTTCAAG 
GGCTGCTGCC 
GGATAAGGCG 
AACGACCTAC 
CGAAGGGAGA 
GAGGGAGCTT 
CTGACTTGAG 
CAGCAACGCG 
TCCTGCGTTA 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
CATGGTGAGC 
CGGCGACGTA 
CGGCAAGCTG 
CCTCGTGACC 
GCAGCACGAC 
CTTCAAGGAC 
GGTGAACCGC 
CAAGCTGGAG 
CGGCATCAAG 
CGACCACTAC 
CTACCTGAGC 
CCTGCTGGAG 
CTCAGATCTC 
TGGGTCAAAG 
GCACTGTTCT 
GCTTGCGTTA 
GGAGAAAGAA 
CCACAAGCGG 
AATCATCCAG 
GTCTCAGATG 
TGTTCAAGGT 
GTTTTTCAGT 
CTTGGTGGAA 
CAAAACCCAG 
CAGTGGAATT 
TGCTGCAGGA 
CTTTAACAGC 
CTCCAAAGGA 
TGGGCAGCCT 



v> i. Ail Gil 

AGAGCGCAGA 
AACTCTGTAG 
AGTGGCGATA 
CAGCGGTCGG 
ACCGAACTGA 
AAGGCGGACA 
CCAGGGGGAA 
CGTCGATTTT 
GCCTTTTTAC 
TCCCCTGATT 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTGGT 
AAGGGCGAGG 
AACGGCCACA 
ACCCTGAAGT 
ACCCTGACCT 
TTCTTCAAGT 
GACGGCAACT 
ATCGAGCTGA 
TACAACTACA 
GTGAACTTCA 
CAGCAGAACA 
ACCCAGTCCG 
TTCGTGACCG 
GAGCATATGC 
CCTGTGCATA 
TCAAGACCTT 
AAATCAACCT 
GACAGCAAGA 
CTGATCAAGG 
ATTATTGCAA 
ATTGAAAATG 
CTATC7GCTG 
TTATCTCGCT 
CTTCAGCAGC 
CAAGATATGC 
GCAACCAGTC 
AGCAGCAGCA 
ATTGACAAAA 
CCTCAATCGT 
CCTGCCTCTG 



TGCC*j«jAtCA 
TACCAAATAC 
CACCGCCTAC 
AGTCGTGTCT 
GCTGAACGGG 
GATACCTACA 
GGTATCCGGT 
ACGCCTGGTA 
TGTGATGCTC 
GGTTCCTGGC 
CTGTGGATAA 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TTAGTGAACC 
AGCTGTTCAC 
AGTTCAGCGT 
TCATCTGCAC 
ACGGCGTGCA 
CCGCCATGCC 
ACAAGACCCG 
AGGGCATCGA 
ACAGCCACAA 
AGATCCGCCA 
CCCCCATCGG 
CCCTGAGCAA 
CCGCCGGGAT 
CTGTTCTTGG 
CTGCTCTTCC 
TGGAACTTGC 
GTGAATTTGG 
TTTACACTGA 
ACTTGCAACA 
ATGAAAAAGT 
TTGATGTCTG 
AAGAAATAAG 
ACAAGCAGCA 
GAGTTACTCA 
AGTCCAGTCT 
AAAAAAAGCC 
AGGTCCAGGG 
ACAAGCCTCC 
CTTCAGGTGT 
CCATCCCTTC 



GTAGTGAGGA 
GAGGATCGTT 
TGGAGAGGCT 
TGTTCCGGCT 
CCCTGAATGA 
CTTGCGCAGC 
AAGTG CCGGG 
TGGCTGATGC 
AAGCGAAACA 
ATGATCTGGA 
CGAGCATGCC 
TCATGGTGGA 
ACCGCTATCA 
GGGCTGACCG 
TCTATCGCCT 
AGCGACGCCC 
GGGCTTCGGA 
GCTGGAGTTC 
GGAAGGAACC 
GTTTGTTCAT 
AC CCCATTGG 
GGGTGAAGGC 
TTACTCATAT 
GAAGATCCTT 
AGCGTCAGAC 
AATCTGCTGC 
AGAGCTACCA 
TGTCCTTCTA 
ATACCTCGCT 
TACCGGGTTG 
GGGTTCGTGC 
GCGTGAGCTA 
AAGCGGCAGG 
TCTTTATAGT 
GTCAGGGGGG 
CTTTTGCTGG 
CCGTATTACC 
G CCCATAT AT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
AAATGGGCGG 
GTCAGATCCG 
CGGGGTGGTG 
GTCCGGCGAG 
CACCGGCAAG 
GTGCTTCAGC 
CGAAGGCTAC 
CGCCGAGGTG 
CTTCAAGGAG 
CGTCTATATC 
CAACATCGAG 
CGACGGCCCC 
AGACCCCAAC 
CACTCTCGGC 
GGTTGCCTCA 
GATACCAAAT 
TGAAACAGAG 
AGAGAAGAAA 
CTGGGCCAAC 
AGACATTGCA 
TGAAGATATC 
CCTTAGTTTT 
AAATGGAAAC 
ACAACACCAT 
CGCTTCCCCT 
GGCAGCCAGA 
TACTAGGCTT 
AGCCTCTAAT 
AAATTATGCA 
AAATGGTAAC 
TCCAAGTGCC 



GGCTTTTTTG 
TCGCATGATT 
ATTCGGCTAT 
GTCAGCGCAG 
ACTGCAAGAC 
TGTGCTCGAC 
GCAGGATCTC 
AATGCGGCGG 
TCGCATCGAG 
CGAAGAGCAT 
CGACGGCGAG 
AAATGGCCGC 
GGACATAGCG 
CTTCCTCGTG 
TCTTGACGAG 
AACCTGCCAT 
ATCGTTTTCC 
TTCGCCCACC 
CGCGCTATGA 
AAACGCGGGG 
GGCCAATACG 
CCAGGGCTCG 
ATACTTTAGA 
TTTGATAATC 
C CCGTAG AAA 
TTGCAAACAA 
ACTCTTTTTC 
GTGTAGCCGT 
CTGCTAATCC 
GACTCAAGAC 
ACACAGCCCA 
TGAGAAAGCG 
GTCGGAACAG 
CCTGTCGGGT 
CGGAGCCTAT 
CCTTTTGCTC 
GCCATGCATT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
CTAGCGCTAC 
CCCATCCTGG 
GGCGAGGGCG 
CTGCCCGTGC 
CGCTACCCCG 
GTCCAGGAGC 
AAGTTCGAGG 
GACGGCAACA 
ATGGCCGACA 
GACGGCAGCG 
GTGCTGCTGC 
GAGAAGCGCG 
ATGGACGAGC 
AAACTGAGGC 
CTTGGCACTA 
AGCTCCATGC 
CCCCTCCAAG 
CACTACCTAG 
GATGGAGTAC 
AATGGATGTC 
CTAGCAGCCA 
TTAAAAGCCA 
CAACAACAGT 
CCATCGGAAG 
TATGCAACTC 
CCAGGGCCCT 
TTAAATAGGA 
AATGGAAACG 
GTGCAGCCTC 
AGCAAGCCCT 



GAGGCCTAGG 
GAACAAGATG 
GACTGGGCAC 
GGGCGCCCGG 
GAGGCAGCGC 
GTTGTCACTG 
CTGTCATCTC 
CTGCATACGC 
CGAGCACGTA 
CAGGGGCTCG 
GATCTCGTCG 
TTTTCTGGAT 
TTGGCTACCC 
CTTTACGGTA 
TTCTTCTGAG 
CACGAGATTT 
GGGACGCCGG 
CTAGGGGGAG 
CGGCAATAAA 
TTCGGTCCCA 
CCCGCGTTTC 
CAGCCAACGT 
TTGATTTAAA 
TCATGACCAA 
AGATCAAAGG 
AAAAACCACC 
CGAAGGTAAC 
AGTTAGGCCA 
TGTTACCAGT 
GATAGTTACC 
GCTTGGAGCG 
CCACGCTTCC 
GAGAGCGCAC 
TTCGCCACCT 
GGAAAAACGC 
ACATGTTCTT 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGAGT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CGGTCGCCAC 
TCGAGCTGGA 
ATGCCACCTA 
CCTGGCCCAC 
ACCACATGAA 
GCACCATCTT 
GCGACACCCT 
TCCTGGGGCA 
AGCAGAAGAA 
TGCAGCTCGC 
CCGACAACCA 
ATCACATGGT 
TGTACAAGTA 
AGCCAGCTGT 
CTGGGTCACA 
TTTCTTGTCA 
GAAAAGCCAA 
CAAAATCAGG 
TCCTAGCAGA 
CTAGAAGTCA 
GAGGGGTAAA 
TTCTAGGGCT 
ACTATCAGTC 
CCAGCCAGGC 
AGTCTAATCA 
CTAGGGTGCC 
GAAGTCAGAG 
AAAAAGATTC 
CCAGTACTGC 
GGCGCAGCAA 
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GTCCATGAAT 
AGCCACCTCC 
AACTGCTCCC 
TGCTTTACGC 
TGCCTTTTCT 
AACAAATAGC 
TCTCAGCAAT 
AAATAAAGTT 
TCCAAAAAAG 
TAAGAAGGAA 
AACAGTAAAG 
GACTACCAAG 
TGCTTCTAGT 
TTCCTGTACC 
ACTCCCTCAA 
CAGGGCACAT 
TACAAAGATG 
TGAAGACCCT 
TTTAGAAGAG 
AACATTTGAC 
TCGACCCACC 
TGCTCCCTCC 
CCCCTCAAGG 
CATGTCACAG 
CGATGTGGGT 
CATCAACAGT 
AATACCAGAC 
G G ATGCAG AC 
TAACATCAGC 
CGTCCCCTCT 
CGAGACCTGG 
GGATGCTGGT 
AGGGCAGAAA 
CCAAGGAGGG 
AACCGATGAT 
AAGATCTCCT 
CATTGGAAGA 
ATCTGCCATG 
AATTCCAAAA 
CGGTTCACAG 
TCGCAGCTTG 
CAGATCCAGT 
CTCGAAACTG 
CCAAACAGAC 
AGGTTCCCCC 
GCCAGGATCC 
GGCAGGTGGC 
GCCCAGCCCT 
GGGCAGCATG 
CTCAGACTTA 
GGTTCACTCT 
AGGCAGCAAG 
CATTGACCTC 
GGTCCAGAGC 
GCTTGACAGA 
CAACCAAGAA 
TGGCAACCAG 
CCACTTTTCT 
CATGATGCGC 
CCAGCTTTGT 
CTCATTCAGA 
TTCTTCTCTT 
GAGAGAGCTG 
TGCTCACCTT 
TCTAACTATG 
AATGCTGAAG 
AGACCATCCT 
TATCAACAGT 
GAAGAAAAAG 
AGCCTTTGGG 
TACTGATTCA 
AGCATCCATG 
AG AG AT AATT 
TCGGCTGGAG 
- GATGCAGAAT 
TAACACAGCT 



GTCAAACACA 
CCCACACCAT 
TCAGGACAGA 
CCCCCGCAGC 
GAATCTGGTG 
AGTCCCAAAG 
AAAAAGTCTT 
TGCACTGAAA 
ACCTC CAAAA 
AGCTTAATTC 
CAAACCATTT 
GGGAGCCCTT 
TGTCCTGCCC 
ATGACAGTGG 
CAGCAGCAAC 
TCAGAAAATG 
GACTTATCAT 
GAAACAAGAA 
ACTATGTCCA 
AGCACTGTGA 
CCCATGACCT 
CTGGGTGCTG 
TTCATGTATA 
ATTGACATGA 
GGATATATGA 
GGGTACATGA 
ACAGCAACTT 
AGCTGGGATG 
ACTGATGACC 
AGGAAGAATA 
GATAGTCCTG 
GGCAAGTGGA 
GCTTCCCTGT 
GCGCCATCTA 
GCCAAAGCTT 
TCAGATGCAG 
TCGACTGCCA 
ATCACCAGCA 
TCTGCTGCCA 
AATCAGGATG 
CCCCGCCCTT 
ACCAGCAGTA 
AGAGAACCAA 
AAGGAAAAGG 
AAATCCAGCC 
AAGTATCCAG 
AAATCTGCCT 
AGTACCACAT 
GGCAGTGCTG 
ACTACAGATG 
TTCACATCAG 
GATACTCCGA 
CCCCTCAGCC 
CTGCTCATGA 
AATACACTAC 
GAGGGCAAAG 
TCACCTCTGG 
AACTTGGTGA 
TCAAACAGCA 
GGGAGTGCCA 
GACAGCATGG 
TACTCTACAG 
GTTGCATCAC 
GTAGCAGCTT 
ACAGCGGAAC 
GCTCAGAATT 
CCCAAAGATC 
GCCACAAGCC 
AAAAACTGGG 
AAGAAAAAGT 
TCCCTTCCGG 
AAGCCCTCAC 
CTGCAGCTGA 
GCCCTCAGCT 
GAAATTGAAA 
AAGCCTACTC 



GTGCCACCTC 
CTTCAGACAG 
AATCCATGCT 
CTCCCAGTTC 
AAATGGAAGG 
TGTCACCTAA 
TGCTACAGCC 
AACCAGTCAA 
TTGCAAGCTT 
CGTCTTCCAG 
CACCTGGCAG 
CCCAGTCCTT 
CTTTGGAAGG 
CACAAAGCAG 
ATAGCCACCC 
AAGGTACCGC 
ATAGTAAGAC 
GAATGAGAAC 
GTCTTCGTGG 
CAACAGAAGT 
GGAGGTTGGG 
GCTATCCTCG 
CCACGCCTCT 
GTGAGAAAGC 
GTGATGGTGA 
CAGATGGAGG 
CCCGGGACAT 
ACAGCAGTTC 
TGAACACCAC 
CTCAGCTGAG 
AGGAACTGAA 
AGACTGTGTC 
CTGTTTCACA 
GGCAGAAAGC 
CTGAGAAAGG 
G AAAAAG C AG 
CCAGCTCCTT 
GTGGAGCAAC 
TTGGCGGGAA 
ATGTTGTGCT 
CAAAATCCAG 
TTGATTCCAA 
CTAAAATTGG 
AAAAAGTAGC 
CCACCTCTGC 
ATATTGCCTC 
CTGCACCTAA 
TAGCGCGGCA 
GTGGGCTAAG 
TTATAAGCTT 
GTGGTCTCGT 
GCTACCAGTC 
ATCATGGCTC 
GAACGGGTAG 
CCAAAAAGGG 
AGTGGTTGCG 
TTTCCCCTTC 
GCCCAACAAA 
TCCCAGCCCA 
CTTCTCTGGA 
AAGAAGTTCA 
CTGAAGAAAA 
AAGAAAAAGT 
TTG AAAAG AG 
AAAAGGAATC 
CTGCTGCCCA 
TTCGCATCAG 
ATTCCAGTAT 
TGAACTCTAG 
CCACCAAGCC 
CATCCCCCAA 
AATCTGCTTC 
AGAGCGAGCT 
CTGCTCATCA 
TACTGAAAGC 
GGCCACCGTC 



CACCATGTTG 
ACTGAAGCCA 
TGAGAAATTC 
AGGACCTAGT 
TTTTAACAGT 
GTTGGCCCCT 
AAAGGAAAAA 
AGAAGAGAAG 
GATCCCTAAG 
TGGTATTCCA 
CACAGCAAGC 
ATCTAAGCCT 
AAGGGAAGCT 
TGGGCAGAGC 
GAATACCGCG 
TTTACCATCG 
TGCTAAGCAG 
AGTTAAAAAC 
GACTCAGATA 
TAATGGAAGG 
CCAGGCATGT 
CAGTGGTACC 
CCGTCGAGCT 
AAGCAGTGAC 
TATCCTTGGG 
ACTTAACCTA 
CATCCAGAGA 
AGTGAGCAGT 
ATCCTCTGTC 
GACAGATTCA 
AAAACCAGAA 
CTCTGGACTT 
GACAGGTTCC 
TGGAACAAGT 
AAAAGCTCCC 
TGGAGATGAA 
TGGCTTTAAG 
CATAACAAGT 
GTCAAATGCA 
GCATGTTAGC 
CACCAGTGGC 
CGTCAGCAGC 
GTCAGGGCGC 
AGTCTCAGAT 
CAGCGCCTGT 
ACCCACATTT 
TACTGAGGGT 
AGGCAGTCTG 
CGGCAGCAGC 
AAGTCACTCG 
GTGGGCTGCC 
CATGACTAGC 
CTTGTCTGGA 
TGTGAGATCT 
ACTAAGATAT 
TTCTCATTCT 
TGCCATGTCA 
TTTGTCTCAA 
AGACTCTTCC 
GGAAAGACCT 
TGGCTCTTCA 
GGCTCATTCA 
TGCTACCCTC 
CTTAGGGAAT 
TGAACTTATA 
GGCGGCTATT 
AAGACAGCAT 
TGGCAGTGGT 
AGGAAGTGAG 
TCCTTCATCA 
GTTACCCCAT 
AGCGATCTGT 
C AG AG AAAAG 
TCTTGATCAG 
TG AAAATG AC 
AGAATCCTCA 



ACTGTAAAGC 
CCTGTCTCAG 
AAGCTAGTCA 
GATGGTGGGA 
GGTCTGAATA 
CCAAAAG CTG 
GAAGAAAAGA 
GATCAGGTGA 
GGCAGCAAGA 
AAACCAGGCT 
AAAGAGTCTG 
ATAACCATGG 
GGCCAAGCTT 
ACAGGAAATG 
ACAGTGGCAC 
GCTGACTCCT 
TGCCTGGAGG 
ATAGCAGACT 
AGCCACAGCA 
ACCATACCCA 
CCGCGACTTC 
AGTCGATTCA 
GCTGTCTCTA 
CTGGACATGT 
AAAAGTCTCA 
TATACTAGAA 
GGGGTTCACG 
GGTCTCAGTG 
AGCTCTTACT 
GAGAAACGCT 
NAAGATTTTG 
CCTGAAGACC 
TGGAGAAGAG 
GCACTCAAAA 
CTAAAAGGAT 
GGGAAAAAGC 
AAACCAAGTG 
GGCTCTGCAA 
GGGAGAAAAA 
TCAAAGACTA 
ATTCCTGGAC 
AAGTCTGCTG 
TCAAGTCCTG 
TCAGAAAGTG 
GGTGCACAAG 
CGAAGGTTGT 
GTGAAATCTT 
GAGTCACCGT 
AGCCCTCTCT 
TTGGCCTCCA 
AATATGAGCA 
CTCCACACGA 
CTG AC C AC AG 
ACTCTCTCAG 
ACCCCATCAT 
ACTGGAGGGC 
TCTTCTGCAG 
TTTAACCTTC 
TTCGATCTCT 
CGTGCCATCA 
TTATCACTGG 
GAGCAAATCC 
ACATCTCAGC 
ATGACTGGCC 
GAACTAAGAG 
CAGGGAGCAC 
TCCTCTGAAA 
AATGATGCCG 
CTGAGAAGTT 
CATTCTGACA 
AATGCTGGTG 
GAATGCACAG 
GAATTAAAAT 
ATCCGGGAAG 
CGGTTGAAGG 
AGCAGCACCT 



AGTCAAGTAC 
AAGGGGTCAA 
ATGCCCGGAC 
AGGATGATGA 
GTGGTGGCTC 
GAAGCAAAAA 
ACAGGGACAA 
CAGAGATGGC 
CAACAGCAGC 
CTAAAGTTCC 
AGAAATTCAG 
AGAAAGCAAG 
CTCCTTCTGG 
GTGCTGTCCA 
CATTCATTTA 
GTACCAGTCC 
AGATATCTGG 
TGAGGCAGAA 
CCCTGGAGAC 
ACTTGACAAG 
AGGCGGGAGA 
TCCACACAGA 
GGCTGGGAAA 
CTTCTGAGGT 
GGACTGATGA 
GTCTGAACCG 
ATGTGACAGT 
ACACCCTTGA 
CCAACATCAC 
CCACCACAGA 
ACAGCCATGG 
CCGAGAAGGC 
GCATGTCTGC 
CACCCGGGAA 
CATCTCTACA 
CCCCCTCAGG 
GAGTAGGGTC 
CACTGGGTAA 
CCAGTTTGGA 
CCCTACAATA 
GAGGAGGCCA 
GGGCCACCAC 
TCACCGTCAA 
TTTCTTTGTC 
GTCTCAGGCA 
TTGGTGCCAA 
CCTCAGTAAT 
CGTCCGGTAC 
TCAATAAACC 
GCCCAGCATC 
GTTCCTCTGC 
GCTCTGAGTC 
GCACTCACGA 
AAAGCATGCA 
CTCGGCAGGC 
TTCAGGACAC 
CTGGAAAATA 
CCGGGCCCAG 
ATGATGACTC 
GTCATTCGGG 
TGTCCAGCAC 
ATAAACTGCG 
TTTCAGCAAA 
GATTGCAAAG 
AAACCATTGA 
TGAATGGTCC 
GTGTTTCTAG 
ACTCCAAGAA 
CTTTCAAACA 
TTGAAGAGCT 
ACTGTGGCTC 
AAGCTGAGGC 
TAACGGATAT 
CCATGAACCG 
CAGAAACTGG 
CCTCTTCATC 
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TTCCAGGCAG TCATTAGGAC TTTCTCTAAA CAATTTGAAC ATCACAGAGG 
AGATATTTTG CTAGATGATG CTGGTGATGC AACTGGACAT AAAGATGGCC 
AATTATAGTC TCCATAAGCA AGGGCTATGG TCGAGCAAAG GACCAAAAAT 
TTTGATAGGC TCCATTGGTG TTAGTGGAAA AACCAAGTGG GATGTCTTAG 
AAGACGTCTC TTTAAGGAAT ATGTATTCCG AATTGATACA TCCACTAGCC 
CTCTGACTGC ATTGCTAGCT ACTGTATAGG AGACTTAATT AGATCCCATA 
GCCTGAATTG CTGCCTTGTG GATACCTTGT TGGAGATAAT AACATCATCA 
CAAAGGGGTA GAAGAAAATA GTTTGGACAG TTTTGTTTTT GATACGCTGA 
AATTACCCAA AGGTACTTTA ACTTGTTGAT GGAGCATCAC AGAATTATAC 
GAGTGGTACT GGAAAGACCT ATTTGGCAAA CAAACTTGCT GAATATGTAA 
TGGAAGGAAA AAAACAGAGG ATGCAATTGC CACTTTTAAT GTGGACCACA 
GGAATTGCAA CAATATCTAG CTAACCTGGC TGAACAGTGC AGTGCTGATA 
GGAGCTCCCA GTTGTAATAA TTCTTGATAA TCTTCATCAT GTGGGCTCTC 
CTTCAATGGT TTTCTCAATT GTAAATACAA CAAATGTCCA TATATTATTG 
TCAGGGAGTT TCTTCATCAC CAAATCTAGA GCTGCATCAC AATTTCAGGT 
TGCAAATCAT AC AGAAC CAG TGAAAGGCTT TTT AG GCAG A TATCTTCGAA 
AGAGATAGAA ATTGAAAGGA ACATTCGCAA TAATGACCTA GTCAAAATTA 
TCCGAAGACG TGGCATCATC TCAACAGTTT TTTGGAAACA CACAGTTCTT 
CATTGGTCCC CGACTATTCC TTCCTTGCCC CATGGATGTA GAAGGTTCTA 
CATGGATCTC TGGAACTATT CTTTAGTACC TTATATTCTG GAGGCAGTGA 
TCAGATGTAT GGGAAACGCA CACCATGGGA AGATCCTTCA AAGTGGGTGC 
TCCATGGAGC TCAGCAACTC TGCCTCAGGA GAGCCCAGCC TTACTTCAGC 
AGATGTTGGG TATGAAAGCT GCACATCCAC TAAGGAAGCC ACAACCTCAA 
GCAAACTGAC ACAGAAGGAG ATCCCCTGAT GAATATGCTA ATGAAACTCC 
C AATT ACT CA AGCACACAAA GCTGCGACAG CGAAAGCACC AGCCACCATG 
GGATTCATCT CTTGAATCTA CCCTCTAGAG GGTGAAAGCC GAAATCCAGC 
CCGTTACTAG TGGATCGGCC GC 



CTGTTAGCTC 
GCAGTGTGAA 
CTCAGGCATA 
ATGGTGTAAT 
TTGGTCTGAG 
ACCTAGAAGT 
CTGTGAACCT 
TTCCTAAACC 
TCTCAGGACC 
TAACCAAATC 
AGTCAAGTAA 
ATAATGGAGT 
TGAGTGATAT 
GAACAATGAA 
GGGTATTATG 
GAAAACTCAT 
TAGATTGGAT 
CTGACGTTAC 
GAGTATGGTT 
GAGAGGGTCT 
TTGACACATA 
TGCGACCAGA 
AGCACATTCC 
AAGAAGCAGC 
AAGACATTTT 
ACACTGGCGG 
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Legend: pGI3305 was obtained by inserting a 7148 bp 
XhoI/SacII fragment of the Hs-unc-53 /3AlLd22 clone in a 
XhoI/SacII opened pEGFPc3 vector (Clontech Inc.). This 
plasmid encodes an eGFP protein in fusion with the full 
length Hs-unc-53/3 (2363 AA) . Arrows indicate the ORFs . 
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Figure 7e: Illustration of the AA sequence of GFP : : Hs- 
unc-53/3 (insert of pGl3305) 




Legend: Single underlined AA 8e ^ en «'f ""^^"hs- 
Double underlined AA sequence represents full lengtn hs 

unc-53/3. 
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FIG. 8 Illustration of the filopodia and lamellipodia 
outgrowth of N4 mouse neuroblastoma cells transfected 
with pGI3303 . 

A: 




B: 







Legend: Fluorescence images of N4 cells transfected with 
pEGFP (A) compared to pGI3303 transfected cells (B and 
C) / A: control (pEGFP ) transfected cells. B: Illustration 
of filopodia outgrowth (arrowhead). C: Illustration 
lamellipodia outgrowth (arrowhead) . Notice the 
sheets at the edge of the cells. 



of 
actin 
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FIG. 9 Illustration of the co-localization of the GFP- 
Hs-unc-53/3 fusion protein with microtubules in N4 mouse 
neuroblastoma cells transfected with OGI33 05 

A: 




B: 





C: 




Legend: Fluorescence images of N4 cells transfected with 
pEGFP (A) compared to pGI3305 transfected cells (B and 
C) . A: control transfected cells. B: Illustration of co- 
localization of Hs-unc-53/3 with microtubuli . Notice the 
centrosome in the right picture (arrowhead) and enhanced 
filopodia outgrowth in the left picture (arrowhead). C: 
Illustration of the co-localization of Hs-unc-53/3 
with(+)-end of microtubules (arrowhead). 
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Figure 11a: Illustration of the homology between Hs-unc- 
53/3 and a gene encoded (partially) by the Drosophila 
melanogaster BAC clone BACR48M05 (AC005719) . Results of a 
TBLASTN search on the non-redundant database with Hs-unc- 
53/3 as query. 

Query: Hs-unc-53/3 (direct) 2120aa Length 21X9 from:l to = 2119 

Sbjct: gb| AC005719 | AC005719 Drosophila melanogaster, chromosome 2R, region 

38A5-3 8B4, BAC clone 

BACR48M05. complete sequence (Drosophila melanogaster] 
Length = 188357 

Score = 64.0 bits (153), Expect = 4e-08 
Identities = 28/58 (48%), Positives = 41/58 (70%) 

Query: 1 IYTDWANHYLAKSGHKRLIKDLQQDIADGVLLAEI IQI IANEKVEDINGCPRSQSQMI 58 

IYTDWAN+YL ++ KR + DL D DG+LLAE+I + ♦ + KV D+ P++Q QM+ 
Sbjct: 84874 lYTDWANYYLERAKSKRKV^DLSADCRDGLLLAEVIEAVTSFKVPDLVKKPKNQQQW 84701 

Score =39.9 bits (91), Expect = 0.77 

Identities = 22/55 (40%), Positives = 34/55 (61%) 

Query: 48 NGCPRSQSQMIENVDVCLSFLAARGVN-VQGLSAEEIRNGNLKAILGLFFSLSRYK 102 

N C Q +NV> CL L ++ V + + +1 G LKA+L LFF+LSR+K 

Sbjct: 55621 NSCSLFQ FDNVNSCLHVLRSQSVGGLENITTNDICAGRLKAVLALFFALSRFK 55463 

Score - 35.2 bits (79), Expect =3.8 

Identities = 31/72 (43%), Positives = 45/72 (62%) 

Query: 1266 LEERPRAISHSGSFRDSMEEVHGSSLSLVSSTSSLYSTAEEKAHSEQIHKLRRELVASQE 1325 

L+ R + HS S VHGS SL+S SSLY AEE+ + +1 +L+REL +++ 

Sbjct: 13387 LKSRLMQLCHSVSV SVHGSAASLLSGGSSLYGNAEER-QAHEIRRLKRELQDARD 13226 

Query: 132 6 KVATLTSQLSAN 13 37 

+V +L+SQLS N 
Sbjct: 13225 QVXSLSSQLSTN 13190 
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Fimire lib- Illustration of an ORF encoded by the 
DrS^opwia melanosaster BAC clone BACR48M05 (K0057U) as 

prediction by the computer program Fgene. 



Output file 
F469BE1C 
length of 
number of 
positions 
4726 - 
4816 - 
5018 - 
8693 - 
38041 - 
62411 - 
74061 - 
103484 - 
132758 - 
153576 - 
154573 - 
154753 - 
160324 - 
161337 - 
171340 - 
171821 - 
172024 - 
174437 - 
175017 - 
179216 - 
187662 - 



for REVERSE STRAND of FGene 



secjuence 
predicted 
of predici 
4757 w= 
4966 w= 
5318 w= 
8727 w= 
38265 w= 
62522 w= 
74692 w= 
103654 w= 
133134 w= 
153706 w= 
154681 w= 
156246 w= 
160375 w= 
161421 w= 
171756 w= 
171965 w= 
172326 w= 
174810 w= 
175168 w= 
179267 w= 
187678 w= 



- 188357 
exons - 



21 



4.11 
20.57 
15.85 
14.75 
8.43 
10.60 
19.39 
24.14 
17.28 
18.42 
20.72 
23.66 
6.48 
6.82 
10.27 
18.76 
15.53 
9.70 
16.41 
6.89 
5.32 



ORF: 


4726 


- 


4755 


ORF: 


4817 


- 


4966 


ORF: 


5018 




5317 


ORF: 


8695 




8727 


ORF: 


38041 




38265 


ORF: 


62411 




62521 


ORF: 


74063 




74692 


ORF: 


103484 




103654 


ORF: 


132758 




133132 


ORF: 


153577 




153705 


ORF: 


154575 




154679 


ORF: 


154754 




156244 


ORF: 


160325 




160375 


ORF: 


161337 




161420 


ORF: 


171342 




171755 


ORF: 


171823 




171963 


ORF: 


172025 




172324 


ORF: 


174438 




174809 


ORF: 


175019 




175168 


ORF: 


179216 




179266 


ORF: 


187664 




187678 



Length of Coding region- 5367bp 
Amino acid sequence - 1788aa 

MDSGICYIKPEYLVTEADGGSAAANTENSDTNKRKREDGGEVEAGEKKKWDKKER^GQN 
KTrapVFKDERYSHLCHSLIIX3TGGEPCSLANC 

Sg^SSa^tdeSr^kredydenappttcngvssaasstlhnasmqmnpltnm 
SSSehel^^S^kdsawifvagfpytltegdlvcvfsqygevvninlir 

dsiSgkskhsplyrgeilfripelsqipdpij^ 

sSaqqsmk^tfi^spfrsgkksidkntseqqraiselvstdhmlhlqqllqqqr 
mshwptSnyvlfnpgpvpsrhvqykirkprplsthsdadsgflspcspeemranp 

PDLVKKPKNQQQMFDNVNSCLHVLRSQS^ 

oa^otS^gcgggvggssstltgsgsvlgigigglrtpgsslnqdknqqeqqqqqqqq 

TPSIPGLGKSGSDFNTSRPNSPPTSNHTIQSLKSGNNNSLRPPSIKSGIPSPSSPQTAPQ 

SqX™svssqkp^ 

SRSSTMGRTGKSSLVRAVGGVEKNTPKTSSKSSLHSKSDSKSSLKAPQLLQSPSSGGLPK 

hq^tSppyyansqptshisshgflsepstpqhssgiygssrlpppksalsaprkleyn 
agphilLpthhqrqglprplvnsapot^^ 

pasggssilpmrpllrgynshvtlptrgargghhphqsyldfcesdigqgycsdgdalrv 
gsspggsrfhdidngylsegssglngpsssaggispgkhflsmmrartqlpttieerqli 

YGAS VP I L.TLLPDRKI YQNNVRQ I KVDKLAAMAERWNMELGNGGAKMDGS PHHRPGSRNG 

RANGSIASTAEQQNIAMMMAAGGAGANGLPCGRTAHVSAVPRTASGRKVAGGTQTLPNDM 
NKLPPNTQHRSFSLTGPTATQLSQSIRERLATGSHSLPKPGSDLHVFQHRISNRGGTRHD 
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GSLSDTQTYAEVKPEYSSYAMWLKHSNTAGSRLSDGESVEQLQIGSPALTRHGHKMIHNR 
SGGPGQMAGQMSGNESPYVQSPRMNRSNSIRSTKSEKMYPSMMSRAGEVEIEPYYCLPVG 
TNGVLTAQMAAAMAAQSQAAQGNPGVGVNVGGVAWSQPTSPTPLTRGPE^NTAAGASVLSP 
THGTTSAAGLVGPGGGAGGGAMVGHRLTYPKKNDEVHGSAASLLSGGSSLYGNAEERQAH 
E I RRLKRELQDARDQVLSLS SQL STNVSKKC PVWFQMYTLRMARSRR * 
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Figure 11c: Illustration of a ' BLAST 2 sequences' search 
result with Hs-unc-53/3 as query and the Fgene predicted 
UNC53 homology ORF of Drosophila melanogaster BAC clone 
BACR48M05 as subject 

Query: Hs-unc-53/3 (direct) 2120aa Length 2119 from:l to = 2119 
Subject: drosUNC53 (Fgene -predict ion) Length 1788 from:l to - 
1788 

Score = 106 bits (261), Expect = 2e-21 

Identities = 190/840 (22%), Positives = 294/840 (34%), Gaps = 185/840 
(22%) 

Query: 1 IYTDWANHYLAKSGHKRLIKDLQQDIATCVLLAEIIQIIANEKVEDINGCPRSQSQMI^ 60 
^ y TYTDWAN+YL ++ KR + DL D DG+LLAE+I+ + ♦ KV D+ P++Q QM +N 

sbjct-. 497 iSdw^lerakskrkvtdlsaixrdglllaevieavtsfkvpdl^pknqqqmfdn 556 

Ouerv- 61 VDVCLSFLAARGV-NVQGLSAEEIRNGNLKAILGLFFSLSRYK "2 

V+ CL L ++ V ++ ++ +1 G LKA+L LFF+LSR+K 
Sbjct: 557 VNSCLHVLRSQSVGGLENITTNDICAGRLKAVLALFFALSRFKQQAKQTKSIGVGCGGGV 616 

Query- 103 XXXXXXXXXXXSLVEL qqrvtHASPPSEASQAKTQQDMQSSLAARYATQSNHSG 156 

S++ + r +S + +Q + QQ Q + QS +G 

Sbjct: 617 GGSSSTLTGSGSVLGIGIGGLRTPGSSLNQDKNQQEQQQQQQQQQTPQQLAQSLENGNEM 67 6 

Query: 157 IATSQKK PTRLPGPSRV PAAGSSSKVQGASNLNRRSQSFNS 197 

Sbjct: 677 WRQ^PAYA^GG^LpATVMVQRRCPPDKVRPLPPTPNHTPSIPGLGKSGSDFNT 736 
Query- 198 IDKNKPPNYANGNEKDSSKGPQS-SSGVNGNVQPPSTAGQXXXXXXXXXXXXKPWRSKSM 256 

N PP S+ QS SG N +++PPS 
Sbjct: 737 SRPNSPPT SNHTIQSLKSGNNNSLRPPSIKSGI 

Query: 257 NVKHSATSTMLTVKQXXXXXXXXXXXDRLKPPVSEGVKT^^^ 316 

„_ A PSPSSPQTAPQ-KHSMLDKLKLFNKEKQQ 797 

Sbjct: 770 

Query: 317 RXXXXXXXXXXXXXXXXXAFSESGEMEGFXXXXXXXXXXXXXPKVSPK^PPKAGSKNLS 376 

S SG + ++ £> 

Sbjct: 798 NAVNAASVASKTQIQSKRTSSSSGFSS-ARSERSDSSLSLNDGHGSQLKPP— SISVS 852 

Query- 377 NKKSLLQPXXXXXXNRDKNKVCTEKPVKEEKDQVTEMAPKKTSKIASLIPKGSKTTAAKK 436 

++K QP ++K+ + KE+ ++ T++ K-f S SL + S + + 

Sbjct: 853 SQKP-QP KTKQSKLLAAQQKKEQANKATKLDKKEKSPARSLNKEESGNES--R 902 

Query- 437 ESLXXXXXXXXXXXXXXXTVKQTISPGSTASKESEKFRTTKGSPSQSLSKPITMEKASAS 496 

c K T S +S S K SL P ++ S+ 

Sbjct: 903 SSTMGRTGKSSLVRAVGGVEKNTPKTSSKSSLHS KSDSKSSLKAPQLLQSPSSG 956 

Query: 497 SCPAPLEGREAGQASPSGSCTMTVAQSSGQSTGNGAVQLP ^ HSHPN J A ™ A ~ 55 ° 

Sbjct: 957 GLPKPIAAI KGTSKLP LgGgLhLPAAESQQNQQLLKRETSDISS 1002 

n„ e r- v - 551 .__ DF iyRAHSENEGTALPSADSCTSPTKMDLSYSKTAKQCLEEISGEGPETR 600 

yuery. = = x p AH t p + + pt S+ ++ + S 

Sbjct: 1003 NISQPPPAEPPISTHAHIHQNQTPPPPYYANSQPTSHISSHGFLSEPSTPQHSSGIYGSS 1062 

Query: 601 RMRTVKNIADLRQNLEETMSSLRGTQISHSTLETTFDSTVTTEVNGRTI-PN-LTSRPTP 658 

R+ K+ + LE * *H * V + NT PN + P+ 

Sbjct: 1063 RLPPPKSALSAPRKLEYNAGPHILSSPTHHQRQGLPRPLVNSAPNTPTASPNKFHTIPSK 1122 

Query: 659 MTWRLGQACPRLQAGDAPSLGAGYPRSGTSRFIHTDPSRFMY-— TTPLRRAAVSRLGN 714 

+ + + ++LAPSGS*P Y TPRA 

Sbjct: 1123 IVGTI YESKEEQLLPAPPPASGGSSILPMRPLLRGYNSHVTLPTRGARGGHHPH 1176 

Query 715 MSQIDMSEKASSDLDMSSEVDVG-GYMSDGDIL--GKS LRTDDINSGYMTDG--GLN 766 

Query. /ia nsgiu ^ ^ SDGD L G S R DI++GY+++G GLN 

Sbjct: 1177 QSYLDFCES DIGQGYCSDGDALRVGSSPGGSRFHDIDNGYLSEGSSGLN 1225 
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Figure 12 : Illustration of an EST encoding a part of the 
Zebraf ish-UNC-53/2 cDNA . 



Sbjct= emb|AI658309|AI658309 fc21d06.yl Zebraf ish WashU MP IMG EST Danio 

rerio cDNA 5* similar to TR:Q20427 Q20427 F45E10.1 mRNA sequence. Length = 445 

Score = 277 bits (702), Expect = 4e-73 

Identities = 124/147 (84%) , Positives = 136/147 (92%) 

Frame = +3 

Query: 2121 LHHNFRWVLCANHTEPVKGFLGRFLRRKLMETEI SGRVRNMELVKI I DWI PKVWHHLNRF 2180 

LHHNFRW+LCANHTEPVKGFLGRFLRRKL+ETEI + RVRN ELVKII+WIP VWHHLNRF 
Sbjct: 3 LHHNFRWILCANHTEPVKGFLGRFLRRKLLETEINSRVRNGELVKIIEWIPSVWHHLNRF 182 

Query: 2181 LEAHSSSDVTIGPRLFLSCPIDVX>GSRVWFTDLWNYSIIPyLLEAVREGLQLYGRRAPWE 2240 

LE HSSSDVTIGPRLFLSCP+DV+GSRVWFTDLWNYSIIPY+LEAVREGLQ+YGR+A WE 
Sb j ct : 183 LETHS S SDVT T G PRIjFIj SC PMDVEG S R VWFTDLWNYS I X P YML EA VREG L QMYG RXA S WE 362 

Query: 2241 DPAKWVMDTYPWAASPQQHEWPPLLQL 2267 

DPAKWVM++ ASPQQHEW LL+L 
Sbjct: 3 63 DPAKWVMESLLCVASPQQHEWHSIXRL 443 



Query= hh2UNC53 



(2340 letters) 
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Figure 13. Genemap98 results for Hs-Unc53/2 



UniGene |Hs. 13830 1 1 


RH MaDDine Results _ 


SHGC- 
33456 


G3 Map: 


Chr.ll 


Reference interval: 


ni 1S921-D1 1S1359 (24.9-32.5 cM) 


Physical position: 


911 cR10000(F) 


RH details: 


RHdb RH32790 


Tvoed bv: 


Stanford (see SHGC-33456) 


Electronic 1 


PCR Results — 


RSTs (from GenBank EST division) — 


AA1 15015 


zl04dl0 si Soares pregnant uterus NbHPU Homo sapiens cDNA clone 491347 3 




sts 7 ... 134 bp: SHGC-33456 

o!53ell si Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:1527212 3 


AA918601 


STS |16... 1143 1 bp: | SHGC-33456 


AI248585 


qh71f08.xl Soares_fetalJiver_spleen_lNFLS_Sl Homo sapiens cDNA clone 
IMAGE:1850151 3', mRNA sequence fHomo sapiens] 




STS 19 ... |146 |bo: SHGC-33456 1 


T71262 


yd35b09.sl Homo sapiens cDNA clone 1 10201 3'. 




STS |9|... 1136 Ibp: | SHGC-33456 1 



RH Map Genetic Gene Cytogenetic 
GB4 G3 Map Density Ideogram 




The thick line on the G3 map indicates the position o 
SHGC-33456 See also: equivalent interval on GB4 map 



About This Interval 


Top of interval: 


D11S921 (24.9 CM) 


Bottom of interval : 


D11S1359 (32.5 cM) 


Genetic size of bin: 


8 CM 


Physical size of bin: 


430 CR10000 
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Figure 15: Illustration of the nucleotide sequence of 
PGI3150 and amino acid sequence of the eGFP fusion with a 
C- terminal fragment of Hs-Unc-53/1- 



Hu-unc-53/1 




KanNeo 



eGFP 



ID 



PGI3150 circular DNA; 7655 BP 

DE from coiled coil I till end 

FT CDS 1225. .2019 

FT /vntifkey= tt 4 M 
ft /label=KanNeo 

FT CDS 3942.. 4658 

FT /vntifkey="4" 
ft /label=eGFP 

FT CDS 4719. .7214 

FT /vntifkey="4" 
FT /label=Hu-unc-53/l 

SQ SEQUENCE 7 655 BP; 

CTAGATAACT GATCATAATC AGCCATACCA CATTTGTAGA GGTTTTACTT GCTTTAAAAA 60 

ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT 120 

TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA 180 

AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA ACTCATCAAT GTATCTTAAC 240 

GCGTAAATTG TAAGCGTTAA TATTTTGTTA AAATTCGCGT TAAATTTTTG TTAAATCAGC 3 00 

TCATTTTTTA ACCAATAGGC CGAAATCGGC AAAATCCCTT ATAAATCAAA AGAATAGACC 360 

GAGATAGGGT TGAGTGTTGT TCCAGTTTGG AACAAGAGTC CACTATTAAA GAACGTGGAC 420 

T C C AACGTC A AAGGGCGAAA AACCGTCTAT CAGGGCGATG GCCCACTACG TGAACCATCA 480 

CCCTAATCAA GTTTTTTGGG GTCGAGGTGC CGTAAAGCAC TAAATCGGAA CCCTAAAGGG 540 

AGCCCCCGAT TTAGAGCTTG ACGGGGAAAG CCGGCGAACG TGGCGAGAAA GGAAGGGAAG 600 

AAAGCGAAAG GAGCGGGCGC TAGGGCGCTG GCAAGTGTAG CGGTCACGCT GCGCGTAACC 660 

ACCACACCCG CCGCGCTTAA TGCGCCGCTA CAGGGCGCGT CAGGTGGCAC TTTTCGGGGA 720 

AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC 780 

ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG TCCTGAGGCG 840 

GAAAGAACCA GCTGTGGAAT GTGTGTCAGT TAGGGTGTGG AAAGTCCCCA GGCTCCCCAG 9 00 

CAGGCAGAAG TATGCAAAGC ATGCATCTCA ATTAGTCAGC AACCAGGTGT GGAAAGTCCC 9 60 

CAGGCTCCCC AGCAGGCAGA AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCATAG 1020 

TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC CAGTTCCGCC CATTCTCCGC 1080 

CCCATGGCTG ACTAATTTTT TTTATTTATG CAGAGGCCGA GGCCGCCTCG GCCTCTGAGC 1140 

TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTAGG CTTTTGCAAA GATCGATCAA 1200 

GAGACAGGAT GAGGATCGTT TCGCATGATT GAACAAGATG GATTGCACGC AGGTTCTCCG 12 60 

GCCGCTTGGG TGGAGAGGCT ATTCGGCTAT GACTGGGCAC AACAGACAAT CGGCTGCTCT 1320 

GATGCCGCCG TGTTCCGGCT GTCAGCGCAG GGGCGCCCGG TTCTTTTTGT CAAGACCGAC 13 80 

CTGTCCGGTG CCCTGAATGA ACTGCAAGAC GAGGCAGCGC GGCTATCGTG GCTGGCCACG 1440 

ACGGGCGTTC CTTGCGCAGC TGTGCTCGAC GTTGTCACTG AAGCGGGAAG GGACTGGCTG 1500 
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CTATTGGGCG AAGTGCCGGG GCAGGATCTC CTGTCATCTC ACCTTGCTCC TGCCGAGAAA 1560 

GTATCCATCA TGGCTGATGC AATGCGGCGG CTGCATACGC TTGATCCGGC TACCTGCCCA 1620 

TTCGACCACC AAGCGAAACA TCGCATCGAG CGAGCACGTA CTCGGATGGA AGCCGGTCTT 1680 

GTCGATCAGG ATGATCTGGA CGAAGAGCAT CAGGGGCTCG CGCCAGCCGA ACTGTTCGCC 1740 

AGGCTCAAGG CGAGCATGCC CGACGGCGAG GATCTCGTCG TGACCCATGG CGATGCCTGC 1800 

TTGCCGAATA TCATGGTGGA AAATGGCCGC TTTTCTGGAT TCATCGACTG TGGCCGGCTG 1860 

GGTGTGGCGG ACCGCTATCA GGACATAGCG TTGGCTACCC GTGATATTGC TGAAGAGCTT 1920 

GGCGGCGAAT GGGCTGACCG CTTCCTCGTG CTTTACGGTA TCGCCGCTCC CGATTCGCAG 1980 

CGCATCGCCT TCTATCGCCT TCTTGACGAG TTCTTCTGAG CGGGACTCTG GGGTTCGAAA 2040 

TGACCGACCA AGCGACGCCC AACCTGCCAT CACGAGATTT CGATTCCACC GCCGCCTTCT 2100 

ATGAAAGGTT GGGCTTCGGA ATCGTTTTCC GGGACGCCGG CTGGATGATC CTCCAGCGCG 2160 

GGGATCTCAT GCTGGAGTTC TTCGCCCACC CTAGGGGGAG GCTAACTGAA ACACGGAAGG 2220 

AGACAATACC GGAAGGAACC CGCGCTATGA CGGCAATAAA AAGACAGAAT AAAACGCACG 2280 

GTGTTGGGTC GTTTGTTCAT AAACGCGGGG TTCGGTCCCA GGGCTGGCAC TCTGTCGATA 2340 

CCCCACCGAG AC CCCATTGG GGCCAATACG CCCGCGTTTC TTCCTTTTCC CCACCCCACC 2400 

CCCCAAGTTC GGGTGAAGGC CCAGGGCTCG CAGCCAACGT CGGGGCGGCA GGCCCTGCCA 2460 

TAG CCTCAGG TTACTCATAT ATACTTTAGA TTGATTTAAA ACTTCATTTT TAATTTAAAA 2520 

GGATCTAGGT GAAGATCCTT TTTGATAATC TCATGACCAA AATCCCTTAA CGTGAGTTTT 2580 

CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT 2640 

TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG GTGGTTTGTT 2700 

TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA 2760 

TACCAAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG 2820 

CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGC«ATA 20*1 

AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG 2940 

GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA 3000 

GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA 3060 

GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA 3120 

ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT 3180 

TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC 3 240 

GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT 3300 

CTGTGGATAA CCGTATTACC GCCATGCATT AGTTATTAAT AGTAATCAAT TACGGGGTCA 3360 

TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA TGGCCCGCCT 3420 

GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT TCCCATAGTA 3480 

ACGC CAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC 3540 

TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 3600 

AAATGGCCCG CCTGGCATTA TGCC CAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG 3 660 

TACATCTACG TATTAGTCAT CG CTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 3720 

GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT 3780 

GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC 3840 

CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTGGT 3900 

TTAGTGAACC GTCAGATCCG CTAGCGCTAC CGGTCGCCAC CATGGTGAGC AAGGGCGAGG 3 960 

AGCTGTTCAC CGGGGTGGTG CCCATCCTGG TCGAGCTGGA CGGCGACGTA AACGGCCACA 4020 

AGTTCAGCGT GTCCGGCGAG GGCGAGGGCG ATGCCACCTA CGGCAAGCTG ACCCTGAAGT 4080 

TCATCTGCAC CACCGGCAAG CTGCCCGTGC CCTGGCCCAC CCTCGTGACC ACCCTGACCT 4140 

ACGGCGTGCA GTGCTTCAGC CGCTACCCCG ACCACATGAA GCAGCACGAC TTCTTCAAGT 4200 

CCGCCATGCC CGAAGGCTAC GTCCAGGAGC GCACCATCTT CTTCAAGGAC GACGGCAACT 4260 

ACAAGACCCG CGCCGAGGTG AAGTTCGAGG GCGACACCCT GGTGAACCGC ATCGAGCTGA 4320 

AGGGCATCGA CTTCAAGGAG GACGGCAACA TCCTGGGGCA CAAG CTGG AG TACAACTACA 4 380 

ACAGCCACAA CGTCTATATC ATGGCCGACA AGCAGAAGAA CGGCATCAAG GTGAACTTCA 4440 

AGATCCGCCA CAACATCGAG GACGGCAGCG TGCAGCTCGC CGACCACTAC CAGCAGAACA 4500 

CCCCCATCGG CGACGGCCCC GTGCTGCTGC CCGACAACCA CTACCTGAGC ACCCAGTCCG 4560 

CCCTGAGCAA AGACCCCAAC GAGAAGCGCG ATCACATGGT CCTGCTGGAG TTCGTGACCG 4620 

CCGCCGGGAT CACTCTCGGC ATGGACGAGC TGTACAAGTC CGGACTCAGA TCTCGAGCTC 4680 

AAGCTTCGAA TTCTGCAGTC GACGGTACCG CGGGCCCGGG ATCCTTCCGA GACCCCACGG 4740 

ACGATGTTCA CGGCTCAGTG CTGTCCCTGG CCTCCAGTGC CTCCTCCACC TACTCCTCAG 4800 

CTGAGGAGAG GATGCAATCT GAGCAAATCC GGAAGCTTCG TAGGGAACTG GAATCATCCC 4860 

AGGAAAAAGT GGCCACCTTG ACGTCTCAGC TTTCTGCCAA TGCTAATCTG GTGGCTGCTT 4920 

TTGAGCAGAG CCTGGTGAAT ATGACATCCC GCCTGCGACA CCTGGCAGAG ACGGCCGAGG 4980 

AGAAGGACAC TGAGCTGCTG GATTTGCGAG AAACCATAGA CTTTCTGAAG AAAAAGAACT 5040 

CTGAGGCCCA GGCAGTCATT CAGGGAGCCC TTAATGCCTC AGAAACCACA CCCAAAGAAC 5100 

TTCGGATCAA GAGACAAAAC TCCTCAGATA GCATCTCAAG CCTCAACAGC ATCACTAGCC 5160 

ATTCCAGCAT CGGCAGCAGC AAGGATGCTG ATGCGAAAAA GAAGAAAAAA AAGAGTTGGG 5220 

TCTATGAGCT TCGAAGTTCC TTCAACAAAG CGTTCAGTAT AAAAAAGGGG CCCAAGTCAG 5280 

CTTCCTCATA CTCGGATATA GAGGAGATTG CTACACCCGA CTCTTCAGCC CCCTCATCCC 5340 

CCAAACTACA GCATGGTTCC ACAGAGACTG CTTCACCCTC CATCAAGTCC TCCACCTTGT 5400 

CCTCCGTGGG CACTGATGTC ACCGAGGGCC CTGCTCACCC AGCCCCCCAC ACTAGGCTGT 5460 

TCCATG C AAA TGAGGAGGAG GAGCCAGAGA AGAAGGAGGT ATCGGAGCTG CGCTCTGAGC 5520 

TATGGGAGAA GGAAATGAAG CTTACAGACA TCCGCTTGGA GGCCCTCAAC TCTGCCCACC 5580 

AACTGGATCA GCTTCGGGAG ACCATGCACA ACATGCAGTT GGAGGTGGAC CTGCTGAAAG 5640 

CAGAGAATGA CCGACTGAAG GTAGCCCCAG GCCCCTCATC AGGCTCCACT CCAGGGCAGG 5700 

TCCCTGGATC ATCTGCATTA TCTTCCCCAC GCCGCTCCCT AGGCCTGGCA CTCACCCATT 57 60 

CCTTCGGCCC CAGTCTTGCA GACACAGACC TGTCACCCAT GGATGGCATC AGTACTTGTG 5820 
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GTCCAAAGGA 
AAGGGGACTT 
ACTGGAAGAT 
ACCCAGCCTC 
TGAAACGAGT 
ACATATCAGT 
CGCTGATCCC 
TCGTCCTCTC 
ACCTGGTGGA 
ACCAGCAGTC 
GGGAAACAGG 
GCTCCATCAG 
TTATAGGTAC 
TCAGGATGTT 
TGAGGAGGAA 
GGGTGCTCGA 
GCACCTCAGA 
ACTTCCGGAC 
GAGCCAAGGA 
GGGTCCGGGA 
TGCCCCCACC 
AAGACAGCAC 
AAGAAGCTGC 
AGGCAACACT 
AGCTATCTTA 
GAGGAGAACA 
TTGAGAACTT 
CATTTACTGG 
TTCTTGTTTC 
ACTGCAGCAG 
GCCGAATTCC 



GGAAGTGACC 
GAAGCAGCAG 
GCTGGATGAA 
TACCCTGGGA 
GTTGGATGCA 
CTCCCTCAAA 
CAAGCCGATG 
GGGCCCCAGC 
GCGCTCTGGC 
TTGCAAGGAT 
AATTGGGGAT 
TGAGTTGGTC 
CACCAATCAG 
GACCTTCTCC 
GCTGGTAGAG 
CTGGGTACCC 
CTTCCTCATC 
CTGGTTCATT 
TGGGATAAAG 
CACACTTCCC 
CACCGTGGGC 
CCCAAGTTCT 
CAACTACATT 
TTAAGGGTTC 
GCTCCTCCTC 
GGAGGGAGGA 
CCTAGGAAGG 
CCTCCTCTAA 
AATTACAAAC 
TTCCCCGGAA 
AGCACACTGG 



CTCCGGGTGG 
GAATTCTTCC 
GCTGTTTTCC 
CTAAGCACTG 
GAGCCCCCCG 
GGTCTGAAGG 
ATGCAGCACT 
GGCACGGGCA 
CGTGAGGTCA 
CTGCAACTGT 
GTGCCCCTGG 
AATGGGGCCC 
CCTGTAAAAA 
AACAACGTGG 
TCAGACAGCG 
AAGCTGTGGT 
GGCCCTTGCT 
GACCTGTGGA 
GTCCATGGAC 
TGGCCATCAG 
CCTCACAGCA 
CTGGACTCAG 
GAGTCTCCAG 
GGCAATCACT 
TCCCCTCTCC 
GGAGATGAAA 
AATGGTGGGG 
TGACTTTGGG 
TCCTGGGCTT 
TTCAGCTTGG 
CGGCCGTTAC 



TGGTGAGGAT 
TGGGCTGTAG 
AAGTGTTCAA 
AGTCCATCCA 
AGATGCCTCC 
AGAAATGCGT 
ACATAAGCCT 
AGACCTACCT 
CAGAGGGCAT 
ATCTTTCCAA 
TGATTCTATT 
TCACCTGCAA 
TGACACCCAA 
AGCCAGCCAA 
ACATCAATGC 
ATCATCTCCA 
TCTTTCTGTC 
ACAACTCTAT 
AGAAAGCTGC 
C CCAACAAG A 
TTGCCTCACC 
ATCCTCTGAT 
ATCGAGAAAC 
GTCACCCCCG 
TCTTTCAGAG 
GAGGAGGGAC 
TGGCGTTTGG 
GAAAAGATGA 
TCTGGGGAGG 
ACTTAACCAG 
TAGTT 



GCCCCCGCAG 
CAAGGTCAGT 
GGACTATATT 
TGGCTACAGC 
TTGCCGTCGA 
CGACAGCCTG 
CCTGCTGAAG 
GACCAATCGC 
CGTCAGCACC 
CCTAGCCAAC 
GGATGACCTG 
GTATCATAAA 
CCATGGCTTG 
TGGCTTCCTG 
CAACAAGGAA 
CACCTTCCTT 
GTGTCCCATT 
CATTCCCTAT 
TTGGGAGGAC 
CCAATCAAAG 
TCCCGAGGAT 
GGCCATGCTG 
CATCCTGGAC 
GACAGCAGAA 
CACTGGCTCT 
AGGTTCTTGG 
GAACTTGTGC 
TTCTGGGTCT 
GGTTCAGAAA 
GCTGAACTTG 



CACATCATCA 
GGAAAAGTTG 
TCTAAAATGG 
ATCAGCCACG 
GGTGTCAATA 
GTGTTCGAGA 
CACCGGCGCC 
TTGGCCGAGT 
TTCAACATGC 
CAGATAGACC 
AGTGAAGCAG 
TGTCCCTATA 
CACTTGAGCT 
GTTCGTTACC 
GAGCTGCTTC 
GAGAAGCACA 
GGCATTGAGG 
CTACAGGAAG 
CCAGTGGAAT 
CTGTACCACC 
AGGACAGTCA 
CTGAAACTTC 
CCCAACCTTC 
CGCTGGCATC 
CCAGCCCCAG 
TGCTGTACCT 
CCCCTAAACA 
TTCCCTTGAC 
ACATCAAAAC 
CTCAAAAGAA 
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Figure 16: EST Clone yk480b6 contains a splice variant of 
Ce-UNC-53 
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Results of SIM with: 

Sequence 1 : Ce-unc-53 f (1583 residues) Sequence 2: yk480b06rc, (1556 residues) 

C e -UNC - 5 3 110 LSTYKQKL RQLKKDQKKLEQLPTSIMPPAVSKLPS PRVATS ATASATNPNSNFPQMSTSR 

yk4 8 ObO 6 rc 5 I |QEFGTI^ RQLKKDQKKLEQLPTSIMPPAVSKLPSPRVATSATASATNPNSNF PQMSTSR 

Ce-UNC-53 170 LQTPQSRISKIDSSKIGIKPKTSGLKPPSSSTTSSNNTNSFRPSSRSSGNNNVGSTISTS 

yk480b06rc 65 LQTPQSRISKIDSSKIGIKPKTSGLKPPSSSTTSSNNTNSFRPSSRSSGNNNVGSTISTS 

************************************************************ 

Ce-UNC-53 230 AKSLESSSTYSSISNXjNRPTSQLQKPSRPQTQLVRVATTTKIGSSKLAAPKAVSTPKLAS 

yk480b06rc 125 AKSLESSSTYSSISNliNRPTSQLQKPSRPQTQLVRVATTTKIGSSKLAAPKAVSTPKLAS 
************************************************************ 

Ce-UNC-53 290 WTIGAKQEPDNSGGGGGGMLKLKLFSSKNPSSSSNSPQPTRKAAAVPQQQTLSKIAAPV 

yk48 0b06rc 185 VKT IGAKQEPDNSGGGGGGMLKLKLFS S KNPSS S SNS PQPTRKAAAVPQQQTLSKI AAPV 
************************************************************ 

Ce-UNC-53 3 50 KSGLKPPTSKLGSATSMSKLCTPKVSYRKTDAPIISQQDSKRCSKSSEEESGYAGFNSTS 

yk480b06rc 245 KSGLKPPTSKLGSATSMSKLCTPKVSYRKTDAPIISQQDSKRCSKSSEEESGYAGFNSTS 
************************************************************ 

Ce-UNC-53 410 PTSSSTEGSLSMHSTSSKSSTSDEKSPSSDDLTLNASIVTAIRQPIAATPVSPNIINKPV 

yk480b06rc 305 PTS SSTEGSLSMHSTSSKS STSDEKSPS SDDLTLNAS IVTAIRQPIAATPVSPNI INKPV 
it*********************************************************** 

Ce-UNC-53 470 EEKPTI^VKGVKSTAKKDPPPAVPPRDTQPTIGWSPIMAHKKLTNDPVISEKPEPEKLQ 

yk48 0b06rc 3 65 EEKPTLAVKGVKSTAKKDPPPAVPPRDTQPTIGWSPIMAHKKLTNDPVISEKPEPEKLQ 
************************************************************ 

Ce-UNC-53 530 SMSIDTTDVPPLPPLKSWPLKMTSIRQPPTYDVLLKQGKITSPVKSFGYEQSSASEDSI 

yk48 0b06rc 425 SMS IDTTDVPPLP PLKS WPLKMTS IRQ PPT YDVLLKQGKITS PVKSFGYEQS SASEDS I 
************************************************************ 

Ce -UNC - 5 3 590 VAHASAQWPPTKTSGNHSLERRMGKNKTSESSGYTSDAGVAMCAKMREKLKEYDDMTRR 

yk480b06rc 485 VAHASAQVTPPTKTSGNHSLERRMGKNKTSESSGYTSDAGVAMCAKMREKLKEYDDMTRR 
************************************************************ 

Ce -UNC - 5 3 650 AQNGYPDNFEDSS SLSSGI SDNNELDDI STDDLSGVDMATVASKHSDYSHFVRHPTSSSS 

yk480b06rc 545 AQNGYPDNFEDSS SLSSGI SDNNELDDI STDDLSGVDMATVASKHSDYSHFVRHPTSSSS 
************************************************************ 

Ce-UNC-53 710 KPRVPSRSSTSVDSRSRAEQENVYKLLSQCRTSQRGAAATSTFGQHSLRSPGYSSYSPHL 

yk480b06rc 6 05 KPRVPSRSSTSVDSRSRAEQENVYKLLSQCRTSQRGAAATSTFGQHSLRSPGYSSYSPHL 
*************************** ********************************* 

Ce -UNC - 5 3 770 SVS ADKDTMSMHSQTSRRPSSQKPSYSGQFHSLDRKCHLQEFTSTEHRMAALLSPRRVPN 

y k4 8 ObO 6rc 665 SVSADKDTMSMHSQTSRRPSSQKPS YSGQFHSLDRKCHLQEFTSTEHRMAALLSPRRVPN 
************************************************************ 
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Jk«0b06re 725 ^^ ^mli^iR^ILZESLipRPPR^QSPAD SCIITASPSAPRRSHSPRGFI 



r ttnc 53 838 GSYSARSRGGSSTGIYGETFQLHRLSDBKSPAHSAKSEMGS 

$ k 480b06rc 785 | TARIPLS^SSPVHV^GSVSAR^ 



Ce-UNC-53 879 QLSIASTTAYGSI^KyEHAIRDMARDLECYKNTVDSLTKKQENYGALFDLFEQKLRKLT 
Vk480b06rc 845 QLSIiASTTAYGSLNEKYEHAIRDMARDLECYKNTVDSLTKKQENYGALFDLFEQK^KLT 
yK * ou ♦* .,..*********************************************** 




Ce-UNC-53 999 SSSSKSSKQEKISLSSFGKNKKSW-^— IRSSLSKFTKKKNKNYDEAHMPSISGSQG 

%Z>06rc 965 SSSSKSSKQEKIS^SS^ 

Ce-UNC-53 1052 TLDNIDVIELKQELKERDSALYEWLDNliRAREVDVI^ETVNKLKTENKQLKKEVDKIiT 

^480b06rc 1025 T^DVIEIJBEL^^ 

Ce-UNC-53 1112 NGPATRASSRASIPVIYDDEHVYDAACSSTSASQSSKRSSGCNSIKVTVN^IAGEISSI 

Vk480b06rc 1085 NGPATRASSRASIPVIYDDEHVYDAACSSTSASQSSKRSSGCNSIKV^^IAGEISSI 
yK .***.**********.**************************** ******* ********* 

o-UNC-53 1172 VNPDKEI IVGYLAMSTSQSCWKDIDVSI LGLFEVYLSRIDVEHQLGIDARDS I LGYQIGE 

yk480b06rc 1145 VNPDKEI ^^^^^^^^^^^^^^^f^^^ ^ ♦^^T^t^I'^^^ ^^T^^^^r^*^^^^^ *^r^"*? 

UNC-53 1232 LRRVIGDSTTMITSHPTDILTSSTTIRMFMHGAAQSRVDSLVLDMLLPKQMILQLVKSIL 

yk480b06rc 1205 LRRVIGDSTTMITSHPTDILTSSTTIRMFMHGAAQSRVDSLVLDMLLPKQMIL^ 
y *********************************************************** 

Ce-UNC-53 1292 TERRLVLAGATGIGKSKLAKTLAAYVS I RTNQSEDSI VNI S I PENNKEELLQVERRLEKI 

yM80b06rc 1265 TERRLVLAGATGIGKSKLAKTIJ^^ 

y *********************************************************** 

Ce-UNC-53 13 52 LRSKESCIVILDNIPKNRIAFWSVFANVPLQNNEGPFVVCTVNRYQIPELQIHHNF^S 

vk48 0b06rc 13 25 LRSKESCIVILDNIPKNRIAFWSWANVPLQNNEGPFWCTVNRYQIPELQIHHNF^ 

y *********************************************************** 

Ce-UNC-53 1412 VMSNRLEGF ILRYLRRRAVEDEYRLTVQMPSELFKI I DFFP I ALQAVNNFI EKTNSVDVT 

yk480b06rc 13 85 VMSNRLEGFILRYLRRRAVEDEYRLTVQMPSELFKIIDFFPIALQAVNOT 

************************************************************ 

Ce-UNC-53 1472 VGPRACLNCPLTVIXSSREWFIRLWNENFIPYLERVARDGKKTFGRCTSFEDPTDIV^IW 

Vk480b06rc 1445 VGPRACLNCPLTVDGSREWFIRLWNENFIPYLERVARTCKKTFGRCTSFEDPTDIV^KW 

yk^soDUbrc ;^; # ^^^^^^ w ^****************************************** ** 

Ce-UNC-53 1532 PWFDGENPENVLKRLQLQDLVPSPANSSRQHFNPLESLIQLHATKHQTIDNI 

vk480b06rc 1505 PWFDGENPENVLKRL.QLQDLVPSPANSSRQHFNPLESLIQLHATKHQTIDNI 
y **************************************************** 

Legend: the alternative splices and the mutation (S-P) are 
indicated in red and are boxed. 
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